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This  program  is  for  follow-on  research  efforts  for  the  participants  in  the  Summer  Faculty  Research  Program?^ 
Funding  is  provided  to  establish  RIP  awards  to  about  half  the  number  of  participants  in  the  SFRP.  Participants 
in  the  1989  SFRP  competed  for  funding  under  the  1989  RIP.  Evaluation  of  the  proposals  were  made  by  the 
contractor.  Evaluation  criteria  consisted  of:  1.  Technical  excellence  of  the  proposal  2.  Contbuation  of  the 
SFRP  effort  3.  Cost  sharing  by  the  university.  The  list  of  proposals  selected  for  award  was  forwarded  to 
AFOSR  for  approval  of  fimdbg  and  for  research  efforts  to  be  completed  by  31  December  1990,  The  following 
summarizes  the  events  for  the  evaluation  of  proposals  and  award  of  funding  under  the  RIP.  A.  RIP  proposals 
were  submitted  to  the  contractor  by  1  November  1990.  The  proposals  were  limited  to  $20,000  plus  cost  «haring 
by  the  universities.  The  universities  were  encouraged  to  cost  share,  sbce  this  is  an  effort  to  establish  a  long 
term  effort  between  the  Air  Force  and  the  university.  B.  Proposals  were  evaluated  on  the  criteria  listed  above 
and  the  final  award  approval  was  given  by  AFOSR  after  consultation  with  the  Air  Force  Laboratories.  C 
Subcontracts  were  negotiated  with  the  Universities.  There  were  a  total  of  122  RIP  awards  made  under  the  1989 
program. 
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INTRODUCTION 
Research  Initiation  Program  -  1989 

AFOSR  has  provided  funding  for  follow-on  research  efforts  for  the  participants  in  the 
Summer  Faculty  Research  Program.  Initially,  this  program  was  conducted  by  AFOSR  and 
popularly  known  as  the  Mini-Grant  Program.  Since  1983  the  program  has  been  conducted  by 
the  Summer  Faculty  Research  Program  (SFRP)  contractor  and  is  now  called  the  Research 
Initiation  Program  (RIP).  Funding  is  provided  to  establish  RIP  awards  to  about  half  the  number 
of  participants  in  the  SFRP. 

Participants  in  the  1989  SFRP  competed  for  funding  under  the  1989  RIP.  Participants 
submitted  cost  and  technical  proposals  to  the  contractor  by  1  November  1989,  following  their 
participation  in  the  1989  SFRP. 

Evaluation  of  these  proposals  were  made  by  the  contractor.  Evaluation  criteria  consisted 
of: 


1.  Technical  excellence  of  the  proposal 

2.  Continuation  of  the  SFRP  effort 

3.  Cost  sharing  by  the  university 

The  list  of  proposals  selected  for  award  was  forwarded  to  AFOSR  for  approval  of  funding. 
Those  approved  by  AFOSR  were  funded  for  research  efforts  to  be  completed  by  31  December 
1990. 

The  following  summarizes  the  events  for  the  evaluation  of  proposals  and  award  of  funding 
under  the  RIP. 

A.  RIP  proposals  were  submitted  to  the  contractor  by  1  November  1989.  The 
proposals  were  limited  to  $20,000  plus  cost  sharing  by  the  universities.  The 
universities  were  encouraged  to  cost  share,  since  this  is  an  effon  to  establish  a  long 
term  effort  between  the  Air  Force  and  the  university. 

B.  Proposals  were  evaluated  on  the  criteria  listed  above  and  the  final  award  approval 
was  given  by  AFOSR  after  consultation  with  the  Air  Force  Laboratories. 

C.  Subcontracts  were  negotiated  with  the  universities.  The  period  of  performance  of 
the  subcontract  was  between  October  1989  and  December  1990. 

Copies  of  the  final  reports  are  presented  in  Volumes  I  through  IV  of  the  1989  Research 
Initiation  Program  Report.  There  were  a  .al  of  122  RIP  awards  made  under  the  1989  program. 
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PROGRAM  STATISTICS 


Total  SFRP  Participants  168 

Total  RIP  Proposals  submitted  by  SFRP  132 

Total  RIP  Proposals  submitted  by  GSRP  2 

Total  RIP  Proposals  submitted  134 

Total  rip’s  funded  to  SFRP  94 

Total  RIP’S  funded  to  GSRP  2 

Total  RIP’S  funded  96 

Total  RIP  Proposals  submitted  by  HBCU’s  9 

Total  RIP  Proposals  funded  to  HBCU’s  5 


IV 


LABORATORY  PARTICIPATION 


1 

Particioants 

Submitted 

Funded 

AAMRL 

12 

10 

6 

WRDOAPL 

10 

8 

6 

ATL 

9 

9  (1  GSRP) 

9  (1  GSRP) 

AEDC 

10 

8 

8 

WRDOAL 

7 

5 

4 

ESMC 

0 

0 

0 

ESD 

3 

2 

1 

ESC 

11 

8 

7 

WRDC/FDL 

9 

7 

5 

FJSRL 

7 

5 

4 

AFGL 

12 

10 

6 

HRL 

12 

10  (1  GSRP) 

8  (1  GSRP) 

WRDC/ML 

9 

7 

5 

OEHL 

4 

1 

1 

AL 

12 

10 

6 

RADC 

15 

11 

8 

SAM 

17 

16 

9 

WL 

8 

7 

3 

WHMC 

1 

0 

0 

Total 

168 

134 

96 
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LIST  OF  PARTICIPATING  UNIVERSITIES 
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Alabama,  University  of 

- 

1 

New  York,  State  University  of 

- 

2 

Alfred  University 

- 

1 

North  Carolina  State  University 

- 

1 

Arkansas-Pine  Bluff,  Univ.  of 

- 

1 

Northern  Arizona  University 

- 

1 

Auburn  University 

- 

1 

Northern  Illinois  University 

- 

1 

Bethel  College 

- 

1 

Northwestern  University 

• 

1 

Boston  College 

- 

1 

Notre  Dame,  University  of 

- 

1 

Brescia  College 

- 

1 

Ohio  State  University 

- 

2 

California  Polytechnic 

- 

1 

Oklahoma,  University  of 

- 

3 

California  State  University 

- 

2 

Old  Dominion  University 

- 

1 

Cincinnati,  University  of 

- 

2 

Pennsylvania  State  University 

- 

1 

Denver,  University  of 

- 

1 

Pittsburgh,  University  of 

- 

1 

Eastern  Kentucky  University 

- 

1 

Rhode  Island,  University  of 

- 

1 

Florida  Atlantic  University 

- 

1 

San  Diego  State  University 

- 

1 

Florida  Institute 

- 

1 

San  Jose  State  University 

- 

1 

Florida,  University  of 

- 

4 

Savannah  State  College 

- 

1 

Hamilton  College 

- 

1 

Scranton,  University  of 

- 

1 

Harvard  University 

- 

1 

Southern  Oregon  State  College 

- 

2 

Illinois  Institute  of  Technology 

- 

1 

Southwest  Texas  State  University 

- 

1 

Illinois-Rockford,  University  of 

- 

1 

Tennessee  State  University 

- 

1 

Illinois  State  University 

- 

1 

Tennessee  Technological  Univ. 

- 

1 

Indiana-Purdue,  University  of 

- 

1 

Texas  A&M  University 

- 

6 

Kansas  State  University 

- 

2 

Texas  Southern  University 

- 

1 

Lawrence  Technological  University 

- 

1 

TexaS'San  Antonio,  University  of 

- 

3 

Long  Island  University 

- 

1 

Transylvania  University 

- 

1 

Lowell,  University  of 

- 

1 

Trinity  University 

- 

1 

Massachusetts,  University  of 

- 

2 

US  Naval  Academy 

- 

1 

Michigan,  University  of 

- 

1 

Utah  State  University 

- 

1 

Minnesota-Duluth,  University  of 

- 

2 

Utica  College 

- 

1 

Mississippi  State  University 

- 

2 

Vanderbilt  University 

- 

1 

Missouri-Rolla,  University  of 

- 

1 

Washington  State  University 

- 

1 

Murray  State  University 

- 

1 

West  Virginia  University 

- 

2 

Nebraska-Lincoln,  University  of 

- 

2 

Wisconsin-Platteville,  Univ.  of 

- 

1 

New  Hampshire,  University  of 

- 

1 

Worchester  Polytechnic  Institute 

- 

1 

New  York  Institute  of  Technology 

- 

1 

Wright  State  University 

- 

6 

Total  -  94 
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PARTICIPANTS  LABORATORY  ASSIGNMENT 


AERO  PROPULSION  AND  POWER  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 


Dr.  Jerry  Clark 
Wright  State  University 
Specialty:  Physics 

Dr.  Frank  Gemer 
University  of  Cincinnati 
Specialty:  Mechanical  Engineering 

Dr.  Thomas  Lalk 
Texas  A&M  University 
Specialty:  Mechanical  Engineering 


ARMAMENT  DIRECTORATE 
(Eglin  Air  Force  Base) 

Dr.  Peter  Armendarez 

Brescia  College 

Specialty:  Physical  Chemistry 

Dr.  Joseph  Brown 
Mississippi  State  Uniyersity 
Specialty:  Mechanical  Engineering 

Dr.  Roger  Bunting 
Illinois  State  Uniyersity 
Specialty:  Inorganic  Chemistry 

Dr.  Satish  Chandra 
Kansas  State  University 
Specialty:  Electrical  Engineering 


Dr.  Baruch  Lieber 

State  University  of  New  York 

Specialty:  Aerospace  Engineering 

Dr.  William  Schulz 
Eastern  Kentucky  University 
Specialty:  Analytical  Chemistry 
760-7MG-079  and  210-10MG-095 

Dr.  Richard  Tankin 
Northwestern  Uniyersity 
Specialty:  Mechanical  Engineering 


Mr.  William  Newbold  (GSRP) 
Uniyersity  of  Borida 
Specialty:  Aerospace  Engineering 

Dr.  Boghos  Sivazlian 
Uniyersii}'  of  Borida 
Specialty:  Operations  Research 

Dr.  Steyen  Trogdon 
Uniyersity  of  Minnesota-Duluth 
Specialty:  Mechanics 

Mr.  Asad  Yousuf 
Sayannah  State  College 
Specialty:  Electrical  Engineering 


Dr.  Dayid  Cicci 

Auburn  University 

Specialty:  Aerospace  Engineering 


ARMSTRONG  LABORATORY 
(Brooks  Air  Force  Base) 


Dr.  Robert  Blystone 
Trinity  University 
Specialty:  Zoology 

Dr.  Carolyn  Caudle-Alexander 
Tennessee  State  University 
Specialty:  Microbiology 

Dr.  James  Chambers 
University  of  Texas  -  San  Antonio 
Specialty:  Biochemistry 

Dr.  Mark  Cornwall 
Northern  Arizona  University 
Specialty:  Human  Performance 

Dr.  Vito  DelVecchio 

University  of  Scranton 

Specialty:  Biochemical  Engineering 


Dr.  Gwendolyn  Howze 
Texas  Southern  University 
Specialty:  Molecular  Biology 

Dr.  Harold  Longbotham 
University  of  Tcxas-San  Antonio 
Specialty;  Electrical  Engineering 

Dr.  Ralph  Peters  (1987) 

Wichita  State  University 
Specialty:  Zoology 

Dr.  Raymond  Quock 
Univ.  of  Illinois  at  Rockford 
Specialty:  Pharmacology 

Dr.  Ram  Tripathi 

University  of  Texas-San  Antonio 

Specialty:  Statistics 


ARNOLD  ENGINEERING  DEVELOPMENT  CENTER 
(Arnold  Air  Force  Base) 


Dr.  Brian  Beecken 
Bethel  College 
Specialty;  Physics 

Dr.  Stephen  Cobb 
Murray  State  University 
Specialty:  Physics 

Dr.  John  Francis 
University  of  Oklahoma 
Specialty:  Mechanical  Engineering 

Dr.  Orlando  Hankins 
University  of  North  Carolina  State 
Specialty:  Nuclear  Engineering 


Dr.  Lang-Wah  Lee 

University  of  Wisconsin-Platteville 

Specialty:  Mechanical  Engineering 

Dr.  Chun  Fu  Su 
Mississippi  State  University 
Specialty:  Physics 

Dr.  Richard  Tipping 
University  of  Alabama 
Specialty:  Physics 

Dr.  D.  Wilkes 
Vanderbilt  University 
Specialty:  Electrical  Engineering 


IX 


AVIONICS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 


Dr.  David  Choate 
Transylvania  University 
Specialty:  Mathematics 

Dr.  R.  H.  Cofer 
Florida  Institute 

Specialty:  Electrical  Engineering 


CREW  SYSTEMS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 

Dr.  Thomas  Lockwood 
Wright  State  University 
Specialty:  Toxicology 

Dr.  Ethel  Matin 

Long  Island  University 

Specialty:  Experimental  Psychology 

Dr.  Randy  Pollack 
Wright  State  University 
Specialty:  Anthropology 

Dr.  Donald  Robertson  (1987) 
Indiana  University  of  Pennsylvania 
Specialty:  Psychology 


Dr.  Dar-Biau  Liu 
California  State  University 
Specialty:  Applied  Mathematics 

Dr.  Robert  Shock 
Wright  State  University 
Specialty:  Mathematics 


Dr.  Michael  Stanisic 
University  of  Notre  Dame 
Specialty:  Robotics 

Dr.  Chi-Ming  Tang 

State  University  of  New  York 

Specialty:  Mathematics 

Dr.  Ebo  Tei 

University  of  Arkansas-Pine  Bluff 
Specialty:  Psychology 
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ENGINEERING  AND  SERVICES  CENTER 
(Tyndall  Air  Force  Base) 


Dr.  William  Bannister 
University  of  Lowell 
Specialty:  Organic  Chemistry 

Dr.  Emerson  Besch 
University  of  Florida 
Specialty:  Animal  Physiology 

Dr.  Avery  Demond 
University  of  Massachusetts 
Specialty:  Civil  Engineering 

Dr.  Kirk  Hatfield 
University  of  Florida 
Specialty:  Civil  Engineering 


ELECTRONIC  SYSTEMS  DIVISION 
(Hanscom  Air  Force  Base) 

Dr.  Stephen  Kolitz  (1986) 

University  of  Massachusetts 
Specialty:  Operations  Research 

FLIGHT  DYNAMICS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 

Dr.  Kenneth  Cornelius 
Wright  State  University 
Specialty:  Fluid  Mechanics 

Dr.  Arnold  Polak 
University  of  Cincinnati 
Specialty:  Aerospace  Engineering 

Dr.  Nisar  Shaikh 

University  of  Nebraska-Lincoln 

Specialty:  Applied  Mathematics 


Dr.  Kim  Hayes 

University  of  Michigan 

Specialty:  Environmental  Engineering 

Dr.  Deborah  Ross 
University  of  Indiana-Purdue 
Specialty:  Microbiology 

Dr.  Dennis  Truax  (1987) 

Mississippi  State  University 
Specialty:  Civil  Engineering 

Dr.  George  Veyera 
University  of  Rhode  Island 
Specialty:  Civil  Engineering 


Dr.  Sundaram  Natarajan 
Tennessee  Technical  University 
Specialty:  Electrical  Engineering 


Dr.  William  Wolfe 
Ohio  State  University 
Specialty:  Engineering 

Dr.  Lawrence  Zavodney 

Ohio  State  University 

Specialty:  Mechanical  Engineering 
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FRANK  J.  SEILER  RESEARCH  LABORATORY 
(United  States  Air  Force  Academy) 


Dr.  Robert  Granger 
US  Naval  Academy 
Specialty.  Mechanical  Engineering 

Dr.  Clay  Sharts 

San  Diego  State  University 

Specialty:  Chemistry 


GEOPHYSICS  DIRECTORATE 
(Hanscom  Air  Force  Base) 

Dr.  Phanindramohan  Das 
Texas  A&M  University 
Specialty:  Geophysical  Science 

Dr.  Alan  Kafka 
Boston  College 
Specialty:  Geophysics 

Dr.  Charles  Lishawa 
Utica  College 

Specialty:  Physical  Chemistry 


Dr.  Timothy  Troutt 
Washington  State  University 
Specialty:  Mechanical  Engineering 

Dr.  Hung  Vu 

California  State  University 
Specialty:  Applied  Mechanics 


Dr.  Thomas  Miller 
University  of  Oklahoma 
Specialty:  Physics 

Dr.  Henry  Nebel 
Alfred  University 
Specialty:  Physics 

Dr.  Craig  Rasmussen 
Utah  State  University 
Specialty:  Physics 


HUMAN  RESOURCES  DIRECTTORATE 

(Brooks,  Williams  and  Wright-Patterson  Air  Force  Base) 


Dr.  Kevin  Bennett 
Wright  State  University 
Specialty:  Applied  Psychology 

Dr.  Deborah  Mitta 
Texas  A&M  University 
Specialty:  Industrial  Engineering 

Dr.  William  Smith 
University  of  Pittsburgh 
Specialty:  Linguistics 

Dr.  Stanley  Stephenson 
Southwest  Texas  State  University 
Specialty:  Psychology 


Mr.  John  Williamson  (GSRP) 
Texas  A&M  University 
Specialty:  Psychology 

Dr.  Michael  Wolfe 
West  Virginia  University 
Specialty.  Management  Science 

Dr.  Yehoshua  Zleevi 

Harvard  University 

Specialty:  Electrical  Engineering 

Dr.  Robert  Zerwekh 
Northern  Illinois  University 
Specialty:  Philosophy 
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MATERIALS  DIRECTORATE 
(Wright-Patterson  Air  Force  Base) 


Dr.  Donald  Chung 

San  Jose  State  University 

Specialty:  Material  Science 

Dr.  Kenneth  Currie 
Kansas  State  University 
Specialty:  Industrial  Engineering 

Dr.  Michael  Resch 
University  of  Nebraska-Lincoln 
Specialty:  Materials  Science 


Dr.  James  Sherwood 
University  of  New  Hampshire 
Specialty:  Aerospace  Mechanics 
210-9MG-088  and  210-10MG-098 

Dr.  Michael  Sydor 
University  of  Minnesota-Duluth 
Specialty:  Physics 


OCCUPATIONAL  AND  ENVIRONMENTAL  HEALTH  DIRECTORATE 
(Brooks  Air  Force  Base) 

Dr.  Stewart  Maurer 

New  York  Institute  of  Technology 

Specialty:  Electrical  Engineering 


ROCKET  PROPULSION  DIRECTORATE 
(Edwards  Air  Force  Base) 

Dr.  Lynn  Kirms 

Southern  Oregon  State  College 

Specialty:  Organic  Chemistry 

Dr.  Mark  Kirms 

Southern  Oregon  State  College 

Specialty:  Organic  Chemistry 

Dr.  Faysal  Kolkailah 
California  Polytechnic 
Specialty:  Mechanical  Engineering 


Dr.  Vittal  Rao 
University  of  Missouri-Rolla 
Specialty:  Control  Systems 

Dr.  Larry  Swanson 

University  of  Denver 

Specialty:  Mechanical  Engineering 

Dr.  Roger  Thompson 
Pennsylvania  State  University 
Specialty:  Engineering  Mechanics 


Xlll 


ROME  LABORATORffiS 
(Griffiss  Air  Force  Base) 

Dr.  Charles  Alajajian 

West  Virginia  University 

Soecialtv;  Electrical  Encineerine 

Dr.  Khaja  Subhani 

Lawrence  Tech.  University 
£  jcialtv:  Electrical  Encineerine 

Dr.  Ian  Crosse 

University  of  Massachusetts 

Soecialtv:  Mechanical  Encineerine 

Dr.  David  Sumberg  (1987) 
Rochester  Institute  of  Tech. 
Soecialtv:  Phvsics 

Dr.  Henry  Helmken 

Florida  Atlantic  University 

Soecialtv:  Phvsics 

Dr.  Donald  Ucci 

Illinois  Institute  of  Technology 
Soecialtv:  Electrical  Encineerine 

Dr.  Michael  Klein 

Worcester  Poly  Institute 

Soecialtv:  Phvsics 

Dr.  Kenneth  Walter  (1988) 
Prairie  View  A&M  University 
Soecialtv:  Chemical  Encineerine 

Dr.  William  Kuriger 

University  of  Oklahoma 

Soecialtv:  Electrical  Encineerine 

Dr.  James  Wolper 

Hamilton  College 

Soecialtv:  Mathematics 

WEAPONS  DIRECTORATE 
(Kiitland  Air  Force  Base) 

Dr.  Harry  Hogan 

Texas  A&M  University 

Soecialtv:  Mechanical  Encineerine 

Dr.  Due  Nguyen 

Old  Dominion  University 
Soecialtv:  Civil  Encineerine 

Dr.  Arkady  Kheyfets  (1988) 

North  Carolina  State  University 

Specialty:  Mathematical  Phvsics 

Dr.  Duane  Sanders 

Texas  A&M  University 

Specialty:  Civil  Engineering 
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MINI-GRANT  RESEARCH  REPORTS 


Technical 

Report 

Number  Title  and  Mini-Grant  Number 

Volume  I 

Rome  Laboratories 

1  Optimal  Design  of  Finite  Wordlength  FIR  Digital 
Filters  for  an  Analog  Transversal  Filter  with  Tap 
Weight  Circuitry  Defects  Using  Adaptive  Modeling 
210-10MG-123 

2  Automatic  Adaptive  Remeshing  for  Finite  Element 
Reliability  Assessment  of  Electronic  Devices 
210-10MG-129 

3  lonospherically-Induced  Phase  Distortion  Across 
Wide-Aperture  HF  Phased  Arrays 
210-10MG-047 

4  A  Study  of  Interacting  Tunneling  Units  with 
Possible  Application  to  High  Temperature 
Superconductors 

210-10MG-057 

5  Reduced  Bandwidth  Binary  Phase-Only  Filters 
210-10MG-052 

6  Computer  Modeling  of  GaAs/AlGaAs  MQW  Devices 
for  Optical  Properties 

210-10MG-107 

7  Fiber  Optic  Distribution  System  for  Phased  Array 
Antennas 

760-7MG-113 

8  Continuation  Study  of  a  Communications 
Receiver  for  Spread  Spectrum  Signals 
210-10MG-067 

9  Development  of  a  System  to  Deposit  Thin  Films  of 
Titanium  Carbide  Using  Atomic  Layer  Epitaxy 
219-9MG-113 


Professor 

Dr.  Charles  Alajajian 

Dr.  Ian  Grosse 

Dr.  Henry  Helmken 

Dr.  Michael  Klein 

Dr.  William  Kuriger 
Dr.  Khaja  Subhani 

Dr.  David  Sumberg 
(1987) 

Dr.  Donald  Ucci 

Dr.  Kenneth  Walter 
(1988) 
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10 


Neural  Networks  for  Invariant  Pattern  Recognition  Dr.  James  Wolper 
210-10MG-061 


Arnold  Engineering  Development  Center 


11 

The  Performance  of  IR  Detectors  Illuminated 
by  Monochromatic  Radiation 

210-10MG-029 

Dr.  Brian  Beecken 

12 

Sodium  Fluorescence  Studies  for  Application  to 

RDV  of  Hypersonic  Flows 

210-10MG-076 

Dr.  Stephen  Cobb 

13 

Report  Not  Publishable  At  This  Time 

210-10MG-086 

Dr.  John  Francis 

14 

NOT  PUBLISHABLE  AT  THIS  TIME 
210-10MG-134 

Dr.  Orlando  Hankins 

15 

An  Experimental  Approach  for  the  Design  of  a 

Mixer  for  an  Arc  Heater 

210-10MG-027 

Dr.  Lang-Wah  Lee 

16 

No  Report  Submitted  (1986) 

760-6MG-099 

Dr.  Arthur  Mason 

17 

Laser-Induced  Fluorescence  of  Nitric  Oxide 
210-10MG-054 

Dr.  Chun  Fu  Su 

18 

Spectroscopic  Monitoring  of  Exhaust  Gases 
210-10MG-099 

Dr.  Richard  Tipping 

19 
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Optimal  Design  Of  Finite  VVordlength  FIR  Digital  Filters 
For  an  Analog  Transversal  Filter  With  Tap  Weight 
Circuitry  Defects  Using  Adaptive  Modeling 

by 

Charles  J.  Alajajian 

ABSTRACT 

An  algorithm  is  presented  for  designing  optimal  finite  wordlength  FIR  digital  filters  for  a 
programmable  analog  transversal  filter  with  known  tap  weight  circuitry  defects.  The  technique 
is  an  application  of  an  adaptive  modeling  scheme  formerly  used  by  Widrow  and  Stearns  for  the 
synthesis  of  infinite- precision  FIR  digital  filters  from  specified  frequency  response  characteristics. 

Unlike  many  existing  algorithms  which  utilize  successive  rounding  of  the  infinite-precision 
coefficients,  the  proposed  adaptive  modeling  scheme  for  the  optimal  (Chebyshev)  design  of  finite 
wordlength  FIR  digital  filters  incorporates  the  finite  wordlength  restriction  as  part  of  the  filter 
design  procedure.  The  frequency  response  characteristics  in  the  fundamental  frequency  range  are 
specified  by  taking  the  DFT  of  the  optimal  infinite-precision  filter  coefficients;  these  coefficients 
are  obtained  via  the  Parks-McCIellan  algorithm. 

A  standard  system  identification  architecture  is  used  in  which  the  optimal  infinite-precision  filter 
represents  the  plant,  which  is  known  in  this  scheme.  The  plant  and  the  adaptive  filter  are 
simultaneously  driven  by  an  input  signal  consisting  of  a  sum  of  sinuosoids.  Using  the  truncated, 
optimal  infinite-precision  coefficients  as  the  initial  tap  weight  vector,  the  LMS  algorithm,  which 
is  implemented  digitally,  adjusts  the  tap  weights  from  among  a  finite  set  of  fixed-point  numbers, 
in  an  attempt  to  minimize  the  mean  square  error  in  the  frequency  response. 

The  algorithm  is  conceptually  simple,  requires  very  little  computational  effort,  and  is  effective 
even  for  relatively  long  filter  lengths. 
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I.  INTRODUCTION 

The  programmable  analog  transversal  filter  (PTF)  prototype  used  in  this  work  is  based  upon  an 
acoustic  charge  transport  (ACT)  tapped  delay  line  with  active  GaAs  circuits  for  coefficient 
storage  and  the  tap  weight  circuitry  [1].  When  a  digital  filter  is  implemented  on  this  special- 
purpose  hardware,  the  infinite-precision  filter  coefficients  are  represented  internally  by  the  ACT 
PTF  in  digital  memory  by  a  6  — bit,  signed  magnitude,  tap  weight  word  [1].  "inis  internal 
quantization  results  in  a  departure  from  the  ideal  frequency  response.  Coefficient-quantization 
errors  in  FIR  digital  filters  have  been  thoroughly  studied  by  many  authors,  among  them  [5], 
[15]. 

In  the  manufacture  of  ACT  PTF’s,  the  multiple  tap  weight  circuits  may  exhibit  some  error  in 
representing  the  tap  weights,  primarily  due  to  certain  parasitic  and  nonideal  effects  unavoidable 
in  a  practical  weighting  circuit.  However,  by  careful  modeling,  these  effects  may  be 
compensated  for  [1]. 

Of  a  more  severe  nature  are  defects  in  the  tap  weight  circuitry  which  preclude  the  ability  to 
completely  address,  or  alter  the  tap  weight  values.  That  is,  one  or  more  bits  may  be  fixed  at 
some  prescribed  level  —  fully  on,  fully  off,  or  at  some  level  in  between  [18].  These  tap  weight 
circuitry  defects  manifest  themselves  as  errors  in  the  filter’s  frequency  response. 

An  adaptive  modeling  scheme  formerly  used  by  Widrow  and  Stearns  [12],  [13]  for  the  synthesis 
of  FIR  digital  filters  is  modified  and  applied  to  the  design  of  optimal  finite  wordlength  FIR 
digital  filters  for  an  ACT  programmable  transversal  filter  with  these  tap  weight  circuitry  defects. 
The  modified  scheme  uses  a  digital  implementation  of  the  LMS  algorithm  which  attempts  to 
find  a  set  of  tap  weights  from  among  a  finite  set  of  fixed-point  numbers,  which  minimizes  the 
mean-squared  error  in  the  frequency  response. 

My  research  interests  have  been  in  the  areas  of  computer-aided  design  of  electronic  circuits  and 
in  digital  signal  processing;  this  provides  a  wide  array  of  tools  with  which  to  address  this 
problem. 

II.  OBJECTIVES 

The  primary  objective  of  this  research  effort  is  to  develop  an  algorithm  to  design  optimal  finite 
wordlength  FIR  digital  filters  for  a  programmable  transversal  filter,  based  upon  ACT 
technology,  with  tap  weight  circuitry  defects.  Computer  simulation  will  be  used  to  assess  the 
effectiveness  of  the  algorithm  and  to  simulate  the  tap  weight  circuitry  defects  of  the  actual  ACT 
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PTF  hardware. 


Presently,  a  prototype  analog  PTF,  built  primarily  for  demonstration  purposes  [l]  consists  of  an 
analog  tapped-delay  line  with  64  taps  which  utilizes  digital  memory  to  store  the  binary 
representation  of  each  of  the  desired  tap  weights;  in  [1]  this  is  termed  a  digital/analog  (D/A) 
PTF.  The  primary  components  are  an  analog  tapped-delay  line  and  digital  reference  storage 
registers  and  weighting  circuits.  The  analog  tapped-delay  line  processes  the  input  signal;  the 
digital  storage  registers  record  the  tap  weight  words;  the  weighting  circuits,  consisting  of 
multiplying  D/A  converters  (MDAC)  weight  the  analog  signals  from  the  taps  according  to  the 
digital  tap  weight  words.  The  prototype  D/A  PTF  referred  to  in  this  paper  consists  of  a  64  — tap 
ACT  delay  line  with  a  SAW  frequency  of  360  Mhz,  64  MDAC  circuits  which  utilize  signed 
magnitude  word  format  for  the  tap  weight.*?,  and  a  64X6  static  random-access  tap  weight  store, 
with  an  address  decoder.  The  tap  weight  word  length  is  six  bits,  including  the  sign  bit,  so  that 
the  smallest  number  variation  that  can  be  represented  corresponds  to  that  is,  one  LSB  of  the 
tap  weight  range  [1].  This  internal  quantization  of  the  filter  coefficients,  introduces  movement  in 
the  nominal  position  of  the  zeros  of  the  FIR  filter,  which  results  in  errors  in  the  frequency 
response. 

Nonuniformitics  in  the  active  devices  result  in  nonuniformites  in  the  tap  weights,  which  ate 
manifest  as  errors  ir,  the  frequency  response,  although  ostensibly,  the  MDAC  weighting  circuits 
eliminate  these  errors  to  first  order.  In  practice,  the  MDAC  circuits  introduce  errors  in  the 
actual  weights  due  to  parasitic  capacitances  in  the  C/2C  ladder  circuits,  parasitics  in  the  active 
switches,  and  the  finite  impedances  of  significant  summing  buses.  Identification  and 
compensation  for  these  errors  are  feasible  if  the  MDAC  performance  is  carefully  modeled  [1]. 

In  addition,  limitations  in  tap  weight  uniformity  is  due  to  variations  in  component  parameters 
among  ail  the  MDAC  circuits,  and  to  ACT  delay  line  nonuniformities.  According  to  [1],  the 
latter  can  be  compensated  for,  if  a  memory-mapping  algorithm  is  used  with  a  small  reduction  in 
the  overall  tap  programming  range. 

III.  DESIGN  OF  FINITE  WORDLENCTH  FIR  DIGITAL  FILTERS 

a.  An  adaptive  modeling  scheme  is  first  utilized  to  design  optimal  finite  wordlength  FIR  digital 
filters  assuming  no  tap  weight  circuitry  defects.  The  problem  is  to  design  an  FIR  filter,  whose 
coefficients  are  quantized  to  a  specified  finite  word  length,  which  best  approximates  the 
frequency  response  of  an  ideal  filter  whose  coefficients  have  infinite  precison.  Other  methods 
have  been  proposed  [2]  — [6],  (19),  but,  with  the  exception  of  (2),  the  methods  are  effective  only 
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for  filter  lengths  of  ten  or  less. 


The  method  proposed  in  [2]  yields  the  optimal  (in  the  Chebyshev,  or  min-max  sense)  finite 
wordlength  FIR  filter  coefficients  using  mixed-integer  programming  methods  to  solve  the 
discrete  optimization  problem.  Besides  its  complexity,  the  method  is  very  costly  in  terms  of  the 
amoun'.  f  computer  time  required  (2].  However,  it  is  reported  to  be  the  only  general  way  of 
obtaining  the  optimal  finite- wordlength  coefficients  [2].  The  idea  behind  many  of  these  methods 
is  to  obtain  the  optimal  finite- precision  coefficients  from  an  iterative  process  in  which  the 
infinite-precision  coefficients  are  successively  rounded  (2],  [19];  the  finite-wordlength  restriction  is 
not  incorporated  into  the  filter  design  procedure  and  thus  the  solution  obtained  may  be 
suboptimal  [2].  However,  the  proposed  adaptive  modeling  scheme  for  the  optimal  design  of 
finite-wordlength  FIR  digital  filters  incorporates  the  finite  wordlength  restricition  as  part  of  the 
filter  design  procedure.  Although  optimality  of  the  coefficients  cannot  be  proved  because  the 
finite-precision  implementation  of  the  LMS  algorithm  may  deviate  from  the  infinite-precision 
performance,  the  method  offers  noticeable  improvement  over  the  truncated  infinite-precision 
coefficients. 

This  problem  of  finding  a  suitable  FIR  filter  with  quantized  weights  may  be  modeled  as  an 
adaptive  identification  problem  and  the  same  basic  architecture  may  be  used  [12|  — (14|.  The 
architecture,  depicted  in  Figure  1,  consists  of  an  adaptive  filter  and  a  plant  which  are 
simultaneously  driven  by  the  same  input  signal  x(k).  The  output  of  the  plant,  d(k),  supplies  the 
ideal  or  desired  response  for  the  adaptive  filter;  the  subscript  k  denotes  the  discrete-time  sample 
index.  In  the  identification  problem,  the  plant  is  usually  unknown  and  the  adaptive  filter  is 
used  to  find  a  linear  model  which  represents  the  best  fit  to  the  unknown  plant;  in  the  case  of  a 
dynamic  plant  the  tap  weights  of  the  model  are  time  varying  [14]. 

In  the  present  problem,  the  plant  is  a  known  entity;  it  is  the  ideal  filter  whose  coefficients  have 
infinite-precision.  These  coefficients,  which  constitute  the  filter’s  unit-sample  response,  may  be 
obtained  using  any  FIR  filter  design  technique  and  are  assumed  to  be  exact  to  within  computer 
accuracy.  However,  because  optimal  (Chebyshev)  design  is  the  focus  of  this  research,  the  Parks- 
.VlcClellan  algorithm  is  exclusively  used  [9],  [16].  The  plant  is  not  dynamic  since  the  filter 
coefficients  are  not  time-varying.  The  adaptive  filter  is  a  finite  wordlength  FIR  filter  which 
attempts  to  adjust  its  coefficients  to  closely  approximate  the  ideal  filter  response.  Because  the 
coefficients  are  quantized,  however,  the  rule  for  the  updating  the  adaptive  filter  weights  is 
implemented  digitally;  that  is,  the  internal  algorithmic  calculations  are  quantized  to  a  finite 
precision  [7],  [14].  Thus,  the  scheme  attempts  to  find  the  finite- precision  model  which  represents 
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the  best  fit  to  the  infinite-precision  filter.  While  in  most  finite-precision  realizations  the  input 
samples  are  also  quantized  to  a  finite  precision,  in  the  present  scheme  only  the  adaptive  filter 
weights  are  quantized;  the  samples  of  the  input  signal  x(k),  the  ideal,  or  desired  response  d(k) 
and  the  adaptive  filter  output  y(k)  are  all  assumed  to  be  infinite- precision  quantities  as  implied 
by  Figure  2. 

The  basic  architecture  of  Figure  1  has  been  successfully  employed  by  Widrow  and  Stearns  to 
synthesize  infinite-precision  FIR  filters  from  a  set  of  design  specifications  [12],  [13].  There  are 
some  pronounced  differences  between  the  infinite-precision  synthesis  scheme  of  Widrow  and 
Stearns  and  the  finite-wordlength  design  proposed  here.  In  the  former  scheme,  the  plant,  which 
is  designated  as  a  “pseudo  filter,”  is  generally  not  physically  realizable;  its  unit-sample  response 
is  not  known;  only  the  output  response  of  the  pseudo  filter  to  a  sinusoidal  input  is  known.  This 
is  to  be  contrasted  with  the  proposed  algorithm  for  finite-wordlength  design  in  which  the  plant 
is  a  filter  whose  unit-sample  response  (which  is  exact  to  computer  accuracy)  is  specified;  thus, 
the  output  of  this  ideal  filter  to  a  given  sinusoidal  input  is  also  known. 

A  more  salient  difference  between  the  two  schemes  is  that  the  internal  calculations  are  assumed 
to  be  of  infinite-precision  in  the  former  scheme;  that  is  an  analog  model  is  assumed;  in  the  latter 
scheme,  the  adaptive  algorithm  is  implemented  digitally. 

The  finite-precision  form  of  the  LMS  algorithm,  as  it  applies  to  the  design  of  finite  wordlength 
FIR  filters  is  now  presented  [7],  [14],  Referring  to  Figure  2,  the  Ath  output  sample  of  the 
adaptive  filter,  y(k),  may  be  computed  from 

y(k)=w^(k)x(k)  (1) 

where  the  character  denotes  the  transpose  operator  and  Wq(k)  is  the  tap  weight  vector,  an 
N— vector  of  time-varying  weights;  the  subscript  q  indicates  number  quantization.  That  is, 
Wq(k)  =  [wjq(k),W2q(k),. .  .,Wjyjjj(k)]^.  The  tap-input  vector  x(k),  is  an  N-vector  consisting  of 
the  last  N  samples  of  the  input  signal,  given  by  x(k)  =  [x(k),x(k  — 1),  ...  ,x(k  — N-f  l)]^ 

The  adaptive  filter  employs  the  LMS  algorithm,  which  attempts  to  ^w^just  the  weights  such  that 
the  output  of  the  adaptive  filter  y(k),  and  the  desired  signal  d(k)  are  equal.  The  error  incurred 
in  this  process  for  the  Fth  sample  is 


c(k)=d(k)-y(k) 
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The  finite-precision,  least-mean  square  (LMS)  algorithm  is  thus  summarized  by  equation  (3)  and 
equation  (4)  below  [7],  [14]. 


r(k)=d(k)-Wqx(k)  (3) 

Wq(k-t-l)=Wq(k)-HQ|/ie(k)x(k)|  (4) 

In  equation  (4),  Q{  }  is  an  operator  denoting  an  ideal  quantizer  and  p  is  a  scalar  quantity 
sometimes  referred  to  as  the  adaptation  constant.  Only  the  product  representing  the  gradient 
vector  estimate  /i£(k)x(k)  is  quantized  before  addition  to  the  tap  weight  ciccumulator  to  form 
the  current  tap  weight;  the  starting  weight  values  in  equation  (4)  are  the  truncated  coefficients. 
This  is  a  reasonable  choice,  since  the  optimal  finite  wordlength  coefficients  are  not  expected  to 
deviate  very  much  from  the  truncated  or  rounded  coefficients  [2]. 

Normally,  in  a  digital  implementation,  the  input  signal  x(k),  the  desired  signal  d(k),  and  the 
internal  algorithm  are  all  quantized  to  a  limited  precision  [14].  However,  in  this  particular 
application  only  the  internal  algorithm  for  updating  the  weight  vector  is  quantized,  as  indicated 
by  equation  (4). 

The  LMS  algorithm  is  known  to  be  an  unstable  algorithm  when  implemented  digitally  [7].  That 
is,  the  algorithm  will  not  limit  the  maximum  deviation  from  infinite-precision  performance  to 
within  finite  bounds,  potentially  resulting  in  an  overflow.  Fortunately,  numerical  stability  is 
usually  not  an  important  property  when  an  adaptive  filter  is  used  to  determine  an  unknown 
setting  and  then  the  weights  are  held  fixed  at  that  setting  [7].  Since  the  FIR  filter  is  not  a  time- 
varying  system,  use  of  the  finite-precision  LMS  algorithm  is  not  precluded. 

In  the  finite-precision  implementation,  the  adaptation  constant  must  be  carefully  chosen  in 
order  to  minimize  the  deviation  between  the  finite-precision  and  infinite-precision  performance. 
Several  authors  have  studied  the  choice  of  the  adaptation  constant;  it  is  shown  in  [7]  — [8],  that 
increasing  the  adaptation  constant  /i  minimizes  the  deviation  from  infinite-precision  performance 
although  decreasing  /i  below  a  certain  value  actually  increases  the  mean-square  prediction  error. 
This  behavior  is  different  from  the  in  finite- precision  case  where  decreasing  /i  is  known  to 
improve  performance  when  there  is  no  change  or  a  slowly  varying  relationship  between  x(k)  and 
d(k).  Compounding  the  problem  of  the  choice  of  fi  in  the  finite-precision  case  is  the  conflicting 
fact  that  increasing  /i  can  also  magnify  numerical  errors.  Thus,  there  is  some  tradeoff  to  be 
made  in  the  choice  of  p,  which  generally  involves  some  measure  of  ‘cut  and  try”  [7]. 
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b.  The  adaptive  filter  and  the  ideal  filter  are  simultaneously  excited  by  a  sum  of  M  sinusoids. 
[12]  — [13],  whose  radian  frequencies  u)^T=2x-fj/f5  (i=l,...,.Vf)  consist  of  uniformly  spaced  points 
around  the  unit  circle  between  [0,  jt]  radians.  While  it  is  not  necessary  that  the  points  be 
uniformly  spaced,  this  is  a  convenient  choice  which  works  well  in  the  present  scheme.  The 
sampling  period  T  is  normalized  to  unity  for  convenience;  thus  the  Nyquist  frequency  is  tr  r/s, 
or  0.5  hz.  The  ith  sample  of  the  input  signal  x(k),  is  given  by 


x(k)=  E-  ,sin2ir/jjb  (5) 

(=1 

Increasing  the  amplitude  c,  of  the  lih  sinusoid  has  the  effect  of  more  tightly  holding  the  desired 
response  at  the  (th  frequency  which  might  be  desirable  if  unsatisfactory  results  are  obtained  [12]. 
In  the  present  application,  setting  all  of  the  c,  equal  so  that  each  sinusoid  has  equal  amplitude  is 
usually  satisfactory. 

The  signal  x(k)  must  be  properly  scaled  so  that  the  quantizer  input,  i.  e.  the  quantity  in 
brackets  in  equation  (4),  is  confined  to  the  interval  [  —  1,  +1].  Since  x(k)  is  a  deterministic 
(sinusoidal)  signal,  ensuring  that  the  range  of  x(k)  is  confined  to  this  interval  is  easily 
accomplished.  Initially  all  the  c,  are  set  according  to  the  weighting  desired  (in  this  case  unity), 
and  the  resulting  range  of  x(k)  is  observed.  If  the  observed  range  of  x(k)  is  confined  to  the 
interval  [— R,  R],  x(k)  is  properly  scaled  by  dividing  all  the  «,  by  R. 

The  ideal  filter  response  d(k),  at  time  k,  contains  the  same  terms  as  the  input  signal,  but  each 
individual  sinusoid  is  scaled  in  amplitude  and  shifted  in  phase  by  amounts  a,  and  Oj, 
resjjectively,  which  are  the  amplitude  and  phase  of  the  infinite-precision  filter  measured  at  the 
itk  frequency.  Therefore.  d(k)  is  given  by 


M 

d{k)='^c,a,stn(2nf,k+e,)  (6) 

(=l 


If  there  is  to  be  no  error  due  to  overflow,  the  range  of  d(k)  must  also  be  confined  to  the  interval 
[— 1,-1-1];  this  is  a  common  assumption  in  fixed  — point  analysis  [8].  This  is  not  a  problem 
because  the  digital  filter  obtained  via  the  Parks- McClellan  algorithm  is  nonamplifying;  that  is, 
the  output  power  is  less  than  or  equal  to  the  input  power  and  |h(n)|<l  for  all  n  [2]. 
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The  triplet  (f,,  a„  ^,)  is  seen  to  characterize  the  ideal  response.  This  triplet  is  obtained  as 
follows. 

(1)  An  FIR  filter  of  length-N  is  designed  using  the  Parks-McClellan  algorithm  [9],  [16],  to  give 
the  infinite-precision  unit-sample  response  h(n)  of  length-N.  Since  optimal  Chebyshev  design  is 
the  focus  of  this  research,  the  Parks- McClellan  algorithm  is  used.  However,  any  of  the  many 
other  available  design  techniques  [10],  [16],  such  as  frequency-sampling  design,  least-squared 
error  frequency-domain  design,  or  window-based  designs  could  be  used  instead. 

(2)  An  L-point  FFT  of  h(n)  is  taken  where  L>N;  the  relationship  between  L  and  N  will  soon 
become  evident.  It  is  well  known  that  the  FFT  is  simply  an  efficient  method  for  computing  the 
DFT.  The  L-point  DFT  may  be  interpreted  as  samples  of  the  frequency  response 
evaluated  at  L  uniformly  spaced  frequency  points  w  on  the  unit  circle,  ranging  from  zero  to  2t, 
or  /  ranging  from  zero  to  unity  [16].  That  is. 

H(/)=H(e-’‘"') 

Clearly,  from  equations  (5)  and  (6),  only  A/  frequencies  in  the  fundamental  range  [0,  0.5]  are  of 
interest.  Thus,  if  L  is  even,  only  the  first  A/=^-|-l  complex  DFT  samples  of  the  L-point  DFT 
are  used;  the  last  sample  corresponds  to  the  Nyquist  frequency.  If  L  is  odd,  only  the  first  M 
=  ^  ^  DFT  samples  are  used;  note  that  the  last  sample  does  not  correspond  to  the  Nyquist 
frequency.  Since  the  sampling  frequency  is  normalized  to  unity,  the  sample  (bin)  number  /  and 
the  actual  frequency  are  related  by 

f,=  ^  l</<  M  (8) 

where  A/=  ^  +  1  (L  even)  and  A/=  (L  odd)  and  the  bins  are  numbered  starting  with 

unity.  The  amplitude  and  phase  of  H(/)  are  specified  at  these  A/ frequencies  according  to 


a,=  |H(01 

l</<  M 

(9) 

6,=arg  H(0 

l</<  A/ 

(10) 

If  Af=,V  frequency  samples  are  used,  which  require  an  L=(2N— 2)-point  DFT  (.V  even)  or  an 
L  =  (2jV— l)-point  DFT  {N  odd),  the  finite- precision  LMS  algorithm  is  observed  to  render 
unsatisfactory  results.  Ostensibly,  a  larger  number  of  frequency  samples  is  needed  to 
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satisfactorily  describe  the  magnitude  and  phase  characteristics  of  the  ideal  filter.  By  increasing 
the  number  of  frequency  samples  to  A/=2.V.  the  algorithm  consistently  yields  very  good  results. 
This  requires  an  i  =  (4Af— 2)-point  DFT  (.Veven)  or  an  £  =  (4Af— l)-point  DFT  (.Vodd).  Thus, 
the  unit-sample  response  of  the  ideal,  infinite-precision  filter  of  length  N  is  padded  with  3iV— 2 
zeros  (iV  even)  or  3iV— 1  zeros  {N  odd)  prior  to  taking  the  DFT,  yielding  2N  equally  spaced 
frequency  samples  between  [0,0.5].  Because  of  the  particular  FFT  algorithm  used  to  compute 
the  DFT,  the  filter  length  N  is  not  restricted  to  be  a  power  of  2. 

The  Parks-McClellan  algorithm  may  force  the  ideal  filter  response  to  zero  at  either  f=0.0  or 
f=0.5  [16j.  In  the  latter  case,  it  may  be  desirable  to  utilize  the  odd,  i  =  (4^— l)-point  DFT; 
otherwise,  there  will  automatically  be  one  less  sinusoidal  term  in  equation  (6). 

IV.  FUNCTION  AND  OPERATION  OF  THE  QUANTIZER 

a.  The  ACT  PTF  cannot  internally  represent  the  filter  coefficients  with  infinite-precision  or  an 

unlimited  number  of  bits.  To  model  the  quantization  operation  on  the  computer,  each  infinite- 

precision  coefficient  value  (represented  by  a  floating-point  number  on  the  computer)  is  converted 

into  a  second,  floating-point  number  equal  to  the  finite-precision  value  closest  to  it,  that  can  be 

represented  in  a  signed  magnitude,  fixed-point  number  system  having  A— 1  bits  (excluding  the 

sign  bit).  The  finite-precision  values  that  can  be  represented  in  the  number  system  are  called 

quantization  levels;  the  quantization  step  Q,  is  the  distance  between  two  adjacent  quantization 
—  (6  —  1) 

levels;  Q=2  corresponds  to  a  binary  “1”  at  the  least  significant  register  position  [11], 

[15],  [17],  [21]. 

The  ACT  PTF  represents  the  tap  weight  values  using  a  signed  magnitude,  fixed-point  number 
representation  [1].  The  optimum  infinite-precision  coefficients  are  assumed  to  be  truncated  to  a 
wordlength  of  5  — bits  (b=6);  this  determines  the  quantizer  characteristic  used  in  the  computer 
simulation.  Note  the  magnitude  of  the  error  after  truncation  is  generally  more  severe  than  after 
rounding;  in  the  former  case  it  is  confined  to  the  interval  [— Q,  Q]  while  in  the  latter  case  it  is 
confined  to  the  interval  [— [15]  ,  [17]. 

V.  MODIFICATION  OF  THE  LMS  ALGORITHM  TO  EN.SURE  LINEAR  PHA.SE 

The  L,MS  algorithm  may  not  always  yield  filters  with  linear  phase  even  if  the  frequency  response 
of  the  optimal,  infinite-precision  filter  possesses  the  linear  phase  property.  However,  the 
algorithm  may  be  modified  so  that  the  coefficients  are  adjusted  symmetrically,  thus  ensuring 
filters  with  linear  phase,  as  follows  [12],  [22]. 
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Let  I  =(^],  denote  the  number  of  symmetric  weight  pairs,  where  N  is  the  filter  length,  and  [x] 
denotes  the  largest  integer  less  than  or  equal  to  x.  For  an  odd  length  filter,  the  middle  weight  is 
updated  according  to 

)q  =  j  +  pe(  k)x(k  - 1)  (11) 

while  the  lower  weights  are  updated  according  to 


+  0  +  x(k-I-0}  /=l,2 . 1  (12) 


then  the  upper  weights  are  set  equal  to  w 


k+1  _  k+1 

(I+/+l)q-'^(I-/+l)q 


/=1.2, 


Similarly,  for  an  even  length  filter,  the  lower  weights  are  updated  according  to 


0+x(k-l-/+l)}  1=1.2 . 1  (13) 


then  the  upper  weights  are  set  equal  to  w 


k+1  _  k+1 
(/+I)q-'^(I-/+l)q 


/=1.2,. 


The  components  of  the  finite-precision  tap  weight  vector  are  designated  by  the  q  subscript,  while 
the  sample  index  k  is  indicated  by  a  superscript  to  simplify  the  otherwise  cumbersome  notation. 


VI.  TESTING  OF  THE  FINITE  WORDLENGTH  FIR  FILTER  DESIGN  ALGORITHM 

a.  Six  filters  of  various  lengths  and  types  were  designed  using  the  Parks  — McClellan  algorithm 
[9],  [16],  to  test  the  effectiveness  of  finite-wordlength  FIR  filter  design  based  on  adaptive 
modeling  without  tap  weight  circuitry  defects.  Using  the  truncated  coefficients  as  the  starting 
weights,  the  corresfionding  finite-wordlength  FIR  filters  were  designed  using  the  proposed 
adaptive  modeling  algorithm.  The  optimal  infinite-precision  coefficients,  the  truncated 
coefficients  and  the  optimal  finite-wordlength  coefficients  obtained  using  the  proposed  algorithm 
are  recorded  for  each  test  filter,  and  the  magnitude  responses  are  compared. 

b.  A  length-21  Low-pass  Filter 


FINITE  IMPULSE  RESPONSE  (FIR) 
LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
REMES  EXCHANGE  ALGORITHM 
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FILTER  LENGTH  =  21 
UNIT-SAMPLE  RESPONSE  ♦***♦ 
h(  1)=  .18255444E-01  =  h(  21) 

h(  2)  =  .55136724E-01  =  h(  20) 
h(  3)  =  -.40910738E-01  =  h(  19) 
h(  4)  =  .14930869E-01  =  h(  18) 
h(  5)  =  .27568590E-01  =  h{  17) 
h(  6)  =  -.59407810E-01  =  h(  16) 
h(  7)  =  .44841838E-01  =  h(  15) 
h(  8)  =  .31902670E-01  =  h(  14) 
h(  9)  =  -.14972545E4-00  =  h(  13) 
h(  10)  =  .25687238E-(-00  =  h(  12) 
h(  11)  =  .69994063E-)-00  =  h(  11) 


Band  1 

Band  2 

Lower  band  edge 

0.0 

0.37 

Upper  band  edge 

0.33 

0.50 

Desired  value 

l.O 

0.0 

Weighting 

1.0 

1.0 

Deviation 

0.0988697 

0.0988697 

Deviation  in  dB 

0.8189237 

-20.0987384 

The  truncated,  6-bit  coefficients,  multiplied  by  2®=  32  are 


h(  1) 

=  0 

= 

h(21) 

h(  2) 

=  1 

= 

h(20) 

h(  3) 

=  -1 

= 

h(19) 

h(  4) 

=  0 

= 

h(18) 

h(  5) 

=  0 

= 

h(17) 

h(  6) 

=  -1 

= 

h(16) 

h(  7) 

=  1 

= 

h(15) 

h(  8) 

=  1 

= 

h(14) 

h(  9) 

=  -4 

= 

h(13) 

h(10) 

=  8 

= 

h(12) 

h(ll) 

=  22 

= 

h(ll) 
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The  optimal,  6-bit  coefficients,  multiplied  by  2^  =  32  are 


h(  1) 

= 

0 

= 

h(21) 

h(  2) 

= 

2 

= 

h(20) 

h(  3) 

z= 

-1 

= 

h(19) 

h(  4) 

= 

0 

= 

h(18) 

h(  5) 

= 

1 

= 

h(17) 

h(  6) 

= 

-2 

= 

h(16) 

h(  7) 

= 

1 

= 

h(15) 

h(  8) 

= 

1 

= 

h(14) 

h(  9) 

= 

-5 

= 

h(13) 

h(10) 

= 

8 

= 

h(12) 

h(ll) 

= 

22 

= 

h(ll) 

It  is  interesting  to  note  the  proximity  of  the  optimal  coefficients  to  the  truncated  coefficients.  A 
comparison  of  the  magnitude  responses  is  shown  in  Figure  3.  The  LMS  convergence  is  noted  by 
plotting  the  error  e(k)  versus  the  iteration  number;  this  is  shown  in  Figure  4. 

c.  A  length-20  Low-pass  Filter 

FINITE  IMPULSE  RESPONSE  (FIR) 

LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
REMES  EXCHANGE  ALGORITHM 

FILTER  LENGTH  =  20 
UNIT-SAMPLE  RESPONSE  **••* 
h(  1)=  .48411223E-01  =  h(  20) 
h(  2)  =  .13537383E-01  =  h(  19) 
h(  3)  =  -.39344038E-01  =  h(  18) 
h(  4)  =  .53151826E-01  =  h(  17) 
h(  5)  =  -.31608274E-01  =  h(  16) 
h(  6)  =  -.25162720E-01  =  h(  15) 
h(  7)  =  .83330645E-01  =  h(  14) 
h(  8)  =  -.86372217E-01  =  h(  13) 
h(  9)  =  -.34074439E-01  =  h(  12) 
h(  10)  =  .56718866E-I-00  =  h(  11) 


1-14 


Band  1 

Band  2 

Lower  band  edge 

0.0 

0.37 

Upper  band  edge 

0.33 

0.50 

Desired  value 

1.0 

0.0 

Weighting 

1.0 

1.0 

Deviation 

0.0.0981161 

0.0981161 

Deviation  in  dB 

0.8129651 

-20.1651949 

Because  the  frequency  response  of  this  filter  is  forced  to  be  zero  at  /=0.5,  an  odd  (4A^— 1)=  79- 
point  DFT  is  used  for  the  reason  explained  in  section  III. 

The  truncated.  6  — bit  coefficients,  multiplied  by  32,  are 

h(  1)  =  1  =h{20) 

h(  2)  =  0  =h(19) 

h(  3)  =  -1  =h(18) 

h(  4)  =  1  =h(17) 

h(5)  =  -1  =h(16) 

h(  6)  =  0  =h(15) 

h(  7)  =  2  =h(14) 

h(8)  =  -2  =h(13) 

h(9)  =  -1  =h(12) 

h(10)  =  18  =h(Il) 

The  optimal,  6-bit  coefficients,  multiplied  by  2^=  32,  are 


h(  1) 

1 

= 

h(20) 

h(  2) 

= 

0 

= 

h(19) 

h(  3) 

= 

-1 

= 

h(18) 

h(  4) 

= 

2 

= 

h(17) 

h(  5) 

= 

-1 

= 

h(16) 

h(  6) 

= 

-1 

= 

h(15) 

h(  7) 

= 

2 

= 

h(14) 

h(  8) 

= 

-3 

h(13) 

h(  9) 

-1 

= 

h(12) 

h(10) 

= 

18 

= 

Mil) 
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A  comparison  of  the  magnitude  responses  is  shown  in  Figure  5.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  6. 

d.  A  Iength-21  Dci...dpass  Filter 


FINITE  IMPULSE  RESPONSE  (FIR) 
LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
REMES  EXCHANGE  ALGORITHM 


FILTER  LENGTH  =  21 
*****  UNIT-SAMPLE  RESPONSE  ♦*•*. 
h(  1)  =  .46678024E-02  =  h(  21) 
h(  2)  =  .96758919E-02  =  h(  20) 
h(  3)  =  -.90181293E-01  =  h(  19) 
h(  4)  =  -.25750540E-01  =  h(  18) 
h(  5)  =  .45590497E-01  =  h(  17) 
h(  6)  =  -.10308870E-01  =  h(  16) 
h(  7)  =  .I1038485E-(-00  =  h(  15) 
h(  8)  =  .12596292E>-01  =  h(  14) 
h(  9)  =  -.28589705E-I-00  =  h(  13) 
h(  10)  =  -.17343520E-01  =  h(  12) 
h(  11)  =  .38577729E-I-00  =  h(  11) 


Band  1 

Band  2 

Band  3 

Lower  band  edge 

0.0 

0.18 

0.37 

Upper  band  e^  ge 

0.14 

0.33 

0.50 

Desired  value 

0.0 

1.0 

0.0 

Weighting 

1.0 

1.0 

1.0 

Deviation 

0.1073546 

0.1073546 

0.1073546 

Deviation  in  dB 

-19.3835875 

0.8857342 

-19.3835875 

The  truncated,  6-bit  coefficients,  multiplied  by  2^=  32  are 


h(  1)  =  0  =  h(21) 
h(  2)  =  0  =  h(20) 
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h(  3) 

= 

.2 

= 

h(19) 

h(  4) 

= 

U 

= 

h(18) 

h(  5) 

= 

1 

= 

h(17) 

h(  6) 

0 

h(16) 

h(  7) 

= 

3 

= 

h(15) 

h(  8) 

= 

0 

= 

h(14) 

h(  9) 

= 

-9 

= 

h(13) 

h{10) 

= 

0 

= 

h(12) 

h(ll) 

= 

12 

= 

h(ll) 

The  optimal,  6-bit  coefficients,  multiplied  by  2^  =  32,  are 


h(  1) 

= 

0 

= 

h(21) 

h(  2) 

= 

1 

= 

h(20) 

h(  3) 

= 

-3 

= 

h(19) 

h(  4) 

= 

-1 

= 

h(18) 

h(  5) 

= 

2 

= 

h(17) 

h(  6) 

= 

0 

= 

h(16) 

h(  7) 

= 

4 

= 

h(15) 

h(  8) 

= 

1 

= 

h(14) 

h(  9) 

= 

-9 

= 

h(13) 

h(10) 

= 

0 

= 

h(12) 

h(ll) 

= 

13 

= 

h(ll) 

A  comparison  of  the  magnitude  responses  is  shown  in  Figure  7.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  8. 

e.  A  length-32  (Wide-band)  Bandpass  Filter 


FINITE  IMPULSE  RESPONSE  (FIR) 
LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
REMES  EXCHANGE  ALGORITHM 

FILTER  LENGTH  =  32 
*♦*•*  UNIT-SAMPLE  RESPONSE  *♦••• 
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h(  1)  = 

-.48581465E-02  =  h(  32) 

h(  2)  = 

-.84469004E-02  =  h(  31) 

h(  3)  = 

.72525650E-02  =  h(  30) 

h(  4)  = 

.13274785E-02  =  h(  29) 

h(  5)  = 

.16371012E-01  =  h(  28) 

h(  6)  = 

.10639032E-01  =  h(  27) 

h(  7)  = 

2976335 lE-01  =  h(  26) 

h(  8)  = 

-.31177921E-02  =  h(  25) 

h(  9)  = 

-.36395655E-01  =  h(  24) 

h(  10)  = 

.74851950E-02  =  h(  23) 

h(  11)  = 

.82077622E-01  =  h(  22) 

h(  12)  = 

-.81893674E-02  =  h(  21) 

h(  13)  = 

.73856838E-01  =  h(  20) 

h(  14)  = 

-.11670402E-f-00  =  h(  19) 

h(  15)  = 

-.30804001E-f-00  =  h(  18) 

h(  16)  = 

.31369272E4-00  =  h{  17) 

Band  1 

Band  2 

Band  3 

Lower  band  edge 

0.0 

0.20 

0.4250 

Upper  band  edge 

0.1 

0.35 

0.50 

Desired  value 

0.0 

1.0 

0.0 

Weighting 

1.0 

1.0 

1.0 

Deviation 

0.0056256 

0.0056256 

0.0056256 

Deviation  in  dB 

-44.9966866 

0.0487261 

44.9966866 

The  truncated,  6-bit  coefllcients,  multiplied  by  2®=  32  are 


h(  1) 

z= 

0 

h(32) 

h(  2) 

= 

0 

= 

h(31) 

h(  3) 

z= 

0 

= 

h(30) 

h(4) 

= 

0 

= 

h(29) 

h(  5) 

= 

0 

= 

h(28) 

h(  6) 

= 

0 

= 

h(27) 

h(  7) 

0 

= 

h(26) 

h(  8) 

= 

0 

= 

h(25) 

h(  9) 

= 

-1 

= 

h(24) 

I-I8 


h(10) 

=  0 

=  h(23) 

h(ll) 

—  2 

=  h(22) 

h(12) 

=  0 

=  h(21) 

h(13) 

—  2 

=  h(20) 

h(14) 

=  -3 

=  h(19) 

h(l5) 

=  -9 

=  h(18) 

h(16)= 

=  10 

=  h(17) 

The  optimal,  6-bit  coefficients,  multiplied  by  2^  =  32,  are 


h(  1) 

= 

0 

= 

h(32) 

h(  2) 

= 

0 

= 

h(31) 

h{  3) 

r= 

0 

= 

h(30) 

h(  4) 

= 

0 

= 

h(29) 

h(  5) 

= 

0 

= 

h(28) 

h(  6) 

0 

= 

h(27) 

h(  7) 

= 

0 

= 

h(26) 

h(  8) 

z= 

0 

h(25) 

h(  9) 

= 

-1 

= 

h(24) 

h(10) 

= 

0 

= 

h(23) 

h(ll) 

= 

2 

= 

h(22) 

h(12) 

= 

0 

= 

h(21) 

h(13) 

= 

3 

= 

h(20) 

h(14) 

= 

-4 

= 

h(19) 

h(15) 

= 

-11 

= 

h(18) 

h(16) 

= 

11 

= 

h(17) 

While  the  algorithm  yields  some  improvement  as  seen  from  Figure  9,  better  results  might  be 
anticipated  if  the  10  truncated  coefficients  with  value  zero  were  adjusted  to  a  non-zero  value  by 
the  adaptive  algorithm.  A  comparison  of  the  magnitude  responses  is  shown  in  Figure  9.  A  plot 
of  the  error  e(k)  versus  the  iteration  number  is  shown  in  Figure  10. 

f.  A  length-64  (Narrow-band)  Bandpass  Filter 

FINITE  IMPULSE  RESPONSE  (FIR) 

LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
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REMES  EXCHANGE  ALGORITHM 

FILTER  LENGTH  =  64 
***•,  UNIT-SAMPLE  RESPONSE 
h(  1)  =  -.63719952E-02  =  h(  64) 
h(  2)  =  .12366605E-0I  =  h(  63) 
h(  3)  =  .90891666E-02  =  h(  62) 
h(  4)  =  -.52737236E-02  =  h(  61) 
h(  5)  =  -.65397498E-02  =  h(  60) 
h(  6)  =  .73926853E-02  =  h(  59) 
h(  7)  =  .72776156E-02  =  h(  58) 
h(  8)  =  -.66523440E-02  =  h(  57) 
h(  9)  =  -.60031978E-02  =  h(  56) 
h(  10)  =  .48160733E-02  =  h(  55) 
h(ll)=  .32913051E-02  =  h(  54) 
h(  12)  =  -.12530101E-02  =  h(  53) 
h(  13)  =  .11994049E-02  =  h(  52) 
h(  14)  =  -.40770610E-02  =  h(  51) 
h(  15)  =  -.74582934E-02  =  h(  50) 
h(  16)  =  .11142890E-01  =  h(  49) 
h(  17)  =  .15321170E-01  =  h(  48) 
h(  18)  =  -.19652596E-01  =  h(  47) 
h(  19)  =  -.24457549E-01  =  h(  46) 
h(  20)  =  .29206388E-01  =  h(  45) 
h(  21)  =  .34321159E-01  =  h(  44) 
h(  22)  =  39200 157E-01  =  h(  43) 

h{  23)  =  -.44286929E-01  =  h(  42) 
h(  24)  =  .48936060E-01  =  h(  41) 
h(  25)  =  .53609453E-01  =  h(  40) 
h(  26)  =  -.57691 106E-01  =  h(  39) 

)i(  27)  =  -.61579415E-01  =  h(  38) 
h(  28)  =  .64760943E-01  =  h(  37) 
h(  29)  =  .67547088E-01  =  h(  36) 
h(  30)  =  -.69580477E-01  =  li(  35) 
h(  31)  =  -.71032365E-01  =  h(  34) 
h(  32)  =  .71727105E-01  =  h(  33) 
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Band  1 

Band  2 

Band  3 

Lower  band  edge 

0.0 

0.2375 

0.2875 

Upper  band  edge 

0.2125 

0.2625 

0.50 

Desired  value 

0.0 

1.0 

0.0 

Weighting 

I.O 

I.O 

I.O 

Deviation 

0.0217903 

0.0217903 

0.0217903 

Deviation  ip  dB 

-33.2347400 

0.1872354 

-33.2347400 

Tile  truncated,  6-bit  coefficients,  multiplied  by  2^=  32  are 


h(  1) 

= 

0 

= 

h(64) 

h(  2) 

= 

0 

= 

h(63) 

h(  3) 

= 

0 

= 

h(62) 

h(  4) 

= 

0 

= 

h(61) 

h(  5) 

= 

0 

= 

h(60) 

h(  6) 

= 

0 

= 

h(59) 

H  7) 

0 

= 

h(58) 

h(  8) 

= 

0 

= 

h(57) 

h(  9) 

= 

0 

= 

h(56) 

h(10) 

= 

0 

= 

h(55) 

h(ll) 

= 

0 

= 

h(54) 

h(12) 

= 

0 

= 

h(53) 

h(13) 

= 

0 

= 

b(52) 

h{14) 

= 

0 

= 

h(51) 

h(15) 

= 

0 

= 

h(50) 

h(16) 

= 

0 

= 

h(49) 

h(17) 

= 

0 

= 

h(48) 

h(18) 

= 

0 

= 

h(47) 

h(19) 

= 

0 

= 

h(46) 

h(20) 

= 

0 

= 

h(45) 

h(21) 

= 

1 

= 

h(44) 

h(22) 

= 

-1 

= 

h(43) 

h(23) 

= 

-1 

= 

h(42) 

h(24) 

= 

1 

= 

h(41) 

h(25) 

= 

1 

= 

h(40) 

h(26) 

= 

-1 

h(39) 
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h(27)  =  -1  =  h(38) 

h(28)  =  2  =  h(37) 

h(29)  =  2  =  h(36) 

h(30)  =  -2  =  h(35) 

h(31)  =  -2  =  h(34) 

h(32)  =  2  =  h{33) 


The  optimal,  6-bit  coefficients,  multiplied  by  2^  =  32,  are 


h(  1)  =  0  =  h(64) 

h(  2)  =  0  =  h{63) 

h(  3)  =  0  =  h(62) 

h(  4)  =  0  =  h(61) 

5)  =  0  =  h(60) 

h(  6)  =  0  =  h(59) 

h(  7)  =  0  =  h(58) 

h(  8)  =  0  =  h(57) 

a(  9)  =  0  =  h(56) 

li(lO)  =  0  =  h(55) 

h(ll)  =  0  =  h(54) 

h(12)  =  0  =  h(53) 

h(13)  =  0  =  h(52) 

h(l4)  =  0  =  h(51) 

h(15)  =  0  =  h(50) 

h(16)  =  1  =  h(49) 

h(17)  =  1  =  h(48) 

h(18)  =  -1  =  h{47) 

h(l9)  =  -I  =  h(46) 

h(20)  =  1  =  h(45) 

h(21)  =  1  =  h(44) 

h(22)  =  -1  =  h(43) 

h(23)  =  -1  =  h(42) 

h(24)  =  2  =  h(41) 

h(25)  =  2  =  h(40) 

h(26)  =  -2  =  li(39) 

h(27)  =  -2  =  h{38) 
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•> 


h(28) 

h(29) 

h(30) 

h(31) 

h(32) 


2 

2 

‘7 


li(37) 

h(36) 

h(35) 

h(34) 


2  =  h(33) 


In  this  example,  the  adaptive  algorithm  does  a  much  better  job  of  dealing  with  the  zero 
coefficients  than  in  the  wideband  length-32  bandpass  filter  case.  Indeed,  in  the  present  case.  10 
truncated  coefficients  which  are  zero  in  value,  are  adjusted  by  the  algorithm  to  take  on  a  non¬ 
zero  value.  A  comparison  of  the  magnitude  responses  is  shown  in  Figure  11.  A  plot  of  the  error 
e(k)  versus  the  iteration  number  is  shown  in  Figure  12. 


g.  A  length-31  Bandreject  Filter 

FINITE  IMPULSE  RESPONSE  (FIR) 
LINEAR  PHASE  DIGITAL  FILTER  DESIGN 
REMES  EXCHANGE  ALGORITHM 

FILTER  LENGTH  =  31 
**•••  UNIT-SAMPLE  RESPONSE  **••• 
h(  1)  =  .55182726E-05  =  h(  31) 
h(  2)  =  -.26521932E-01  =  h(  30) 
h(  3)  =  .35114622E-04  =  h(  29) 
h(  4)  =  -.10156000E-04  =  h(  28) 
h(  5)  =  -.22336 191 E-04  =  h{  27) 
h(  6)  =  .44154967E-01  =  h(  26) 
h(  7)  =  -.66720404 E-04  =  h(  25) 
h(  8)  =  .4568561  lE-04  =  h(  24) 
h(  9)  =  -.68394425E-05  =  h(  23) 
h(  10)  =  -. 934690 17E-01  =  h(  22) 
h(  11)  =  .97577121E-04  =  h(  21) 
h(  12)  =  -.82618001E-04  =  h(  20) 
h(  13)  =  .35329113E-04  =  h(  19) 
h(  14)  =  .31394833E-1-00  =  h(  18) 
h(  15)  =  -.77643091  E-04  =  h(  17) 
h(  16)  =  .50010198E-(-00  =  h(  16) 
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Band  1 

Band  2 

Band  3 

Lower  band  edge 

0.0 

0.15 

0.42 

Upper  band  edge 

0.1 

0.35 

0.50 

Desired  value 

1.0 

0.0 

1.0 

Weighting 

1.0 

1.0 

1.0 

Deviation 

0.0237675 

0.0237675 

0.0237675 

Deviation  in  dB 

0.2040268 

-32.4803274 

0.2040268 

The  truncated,  6-bit  coefficients, 


multiplied  by  2^=  32  are 


h(  1)  =  0  =  h(31) 

h(  2)  =  0  =  h(30) 

h(  3)  =  0  =  h(29) 

li(  4)  =  0  =  h(28) 

h(  5)  =  0  =  h(27) 

h(  6)  =  1  =  h(26) 

h(  7)  =  0  =  h(25) 

h(  8)  =  0  =  h(24) 

h(  9)  =  0  =  h(23) 

h(10)  =  -2  =  h(22) 

h(ll)  =  0  =  h(21) 

h(12)  =  0  =  h(20) 

h(13)  =  0  =  h(19) 

h(14)  =  10  =  h(18) 

h(15)  =  0  =  h(17) 

h(16)  =  16  =  h(16) 

The  optimal,  6-bit  coefficients,  multiplied  by  2^=  32,  are 

h(  1)  =  0  =  h(31) 

h(  2)  =  -1  =  h(30) 

h(  3)  =  0  =  h(29) 

h(  4)  =  0  =  h(28) 

h(  5)  =  0  =  h(27) 

h(  6)  =  1  =  h(26) 

h(  7)  =  0  =  h(25) 
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h(  8) 

-1 

= 

h{24) 

h(  9) 

= 

0 

= 

h(23) 

h(10) 

= 

-4 

= 

h(22) 

h(ll) 

= 

0 

h(21) 

h(12) 

= 

0 

= 

h(20) 

h(13) 

= 

0 

= 

h(19) 

h(14) 

= 

10 

= 

h(18) 

h(15) 

= 

0 

=r 

h(17) 

h(16) 

= 

16 

h(16) 

A  comparison  of  the  magnitude  responses  is  shown  in  Figure  13.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  14. 


VII.  DESIGN  OF  FINITE  WORDLENGTH  FIR  DIGITAL  FILTERS  WITH  TAP  WEIGHT 
CIRCUITRY  DEFECTS 

a.  The  adaptive  modeling  scheme  may  also  be  utilized  to  design  optimal  finite-  wordlength  FIR 
filters  for  the  ACT  analog  transversal  filter  with  tap  weight  circuitry  defects.  Because  of  these 
defects,  the  ACT  hardware  may  inaccurately  represent  the  optimal  finite-wordlength  coefficient 
values.  The  same  algorithm  used  to  design  optimal  finite-precision  FIR  filters  (assuming  no 
tafHweight  circuitry  defects)  may  be  used,  with  the  following  modifications.  Equations  (3)  and 
(4)  of  the  finite-precision  least-mean  square  algorithm  must  be  modified  according  to 


€(k)=d(k)-wjx(k) 
w^j(k-f  l)=w^(k)-(-D|Q|pe(k)x(k)}| 


(14) 

(15) 


For  computer  simulation,  equation  (15)  is  interpreted  as  follows.  The  operator  Q{  }  transforms 
the  infinite-precision  filter  coefficients  (represented  by  floating  point  numbers  on  the  computer) 
into  a  second  set  of  floating  point  numbers  (the  quantized  coefficients),  equal  in  value  to  the 
quantization  level  closest  to  the  infinite-precision  coefficient  value.  The  defect  operator  D{  } 
transforms  these  quantized  coefficients  into  a  third  set  of  floating  point  numbers  which  are  the 
quantized  coefficient  values  actually  seen  by  the  hardware.  These  are  dubbed  the  defective 
coefficients,  and  are  denoted  by  the  subscript  d.  The  defective  ACT  hardware  has  the  effect  of 
merely  shifting  some  of  the  quantized  coefficient  values  from  one  quantization  level  to  an 
entirely  different  level;  the  quant. zation  step  does  not  change. 
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b.  If  the  tap  weight  circuitry  defects  in  the  ACT  programmable  transversal  filter  occur 
symmetrically,  so  as  not  to  negate  the  symmetry  of  the  filter  coefficients  as  represented 
internally  by  the  ACT  hardware,  then  equations  (11),  (12)  or  (13)  can  be  employed  in  lieu  of 
equation  (15)  to  ensure  linear  phase.  The  Q  and  D  operators  should  be  introduced  in  these 
equations  in  the  same  manner  as  in  equation  (15)  with  the  starting  weights  set  equal  to  the 
defective  coefficients. 

c:  Table  1  below  lists  the  known  defects  in  the  ACT  hardware  [18).  Tap  weights  are  numbered 
starting  with  unity;  the  first  bit  denotes  the  most  significant  bit  (MSB),  the  fifth  bit  denotes  the 
least  significant  bit;  the  sixth  bit  is  reserved  for  the  sign  bit. 


TABLE  1:  Defective  Tap  Weights  in  the  ACT  [18] 


Bit# 

Fault 

35 

5 (LSB) 

unknown 

36 

2  (MSB-1) 

always  off 

38 

1  (MSB) 

always  on 

57 

4  (MSB-3) 

doesn’t  turn  completely  on 

64 

1  (MSB) 

always  off 

for  n=l,..,32 

5 (LSB) 

always  off 

Note  that  for  filters  with  odd  length  (less  than  length  35),  symmetry  is  always  maintained; 
however,  because  of  the  last  defect  in  Table  1,  symmetry  is  not  maintained  for  any  even-length 
filter.  Since  all  that  is  known  about  tap  weight  57  is  that  it  fails  to  turn  on  completely,  it  is 
assumed  to  turn  on  halfway  in  the  computer  model. 

VIII.  TESTING  OF  THE  FINITE  WORDLENGTH  Elfi  FILTER  DESIGN  ALGORITHM 
WITH  TAP  WEIGHT  CIRCUITRY  DEFECTS 

a.  The  test  examples  are  now  used  to  assess  the  effectiveness  of  the  proposed  adaptive  modeling 
technique  for  the  cancellation  of  errors  due  to  tap  weight  circuitry  defects  in  the  ACT 
programmable  transversal  filter. 

b.  A  length-21  Low-pass  Filter 

The  truncated,  6-bit  coefficients,  as  represented  internally  by  the  ACT  transversal  filter  with  its 
tap  weight  ci.  uitry  defects,  multiplied  by  2^—  32  are 


h(  1) 

0 

h(21) 

h(  2) 

= 

0 

= 

h(20) 

h(  3) 

= 

-1 

h(19) 

h(  4) 

= 

0 

= 

h(18) 

h(  5) 

= 

0 

= 

h(17) 

h(  6) 

= 

0 

= 

h(16) 

h(  7) 

= 

1 

= 

h(15) 

h(  8) 

= 

0 

= 

h(14) 

h(  9) 

= 

-4 

= 

h(13) 

h(10) 

= 

8 

= 

h(12) 

h(ll) 

= 

22 

= 

h(ll) 

The  optimal,  6-bit  coefficients,  obtained  using  the  adaptive  algorithm  in  the  presence  of  tap 
weight  circuitry  defects,  multiplied  by  2^=  32  are 


h(  1)  =  1  =  h(21) 

h(  2)  =  2  =  h(20) 

h(  3)  =  -I  =  h(19) 

h(4)  =  0  =  h(18) 

h(5)  =  1  =  h(17) 

h(  6)  =  -2  =  h(16) 

h(  7)  =  1  =  h(15) 

h(  8)  =  0  =  h(14) 

h(  9)  =  -5  =  h(13) 

h(10)  =  8  =  h(12) 

h(ll)  =  22  =  h(ll) 


A  comparison  of  the  magnitude  responses  is  shown  in  Figure  15.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  16. 

c.  A  length-20  Low-pass  Filter 

The  truncated,  6-bit  coefficients,  as  represented  internally  by  the  ACT  transversal  filter  with  its 
tap  weight  circuitry  defects,  multiplied  by  2^=  .32  are 

h(  1)  =  1 

h(2)  =  0 
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h(3)  =  -I 

h(  4)  =  0 

h(5)  =  -1 

h(  6)  =  0 

h(7)  =  2 

h(  8)  =  -2 

h(  9)  =  -1 

h(10)  =  18 

h(ll)  =  18 

h(12)  =  0 

h(13)  =  -2 

h(14)  =  2 

h(15)  =  0 

h(16)  =  0 

h(17)  =  1 

h(18)  =  0 

h(19)  =  0 

h(20)  =  0 

Since  the  defects  in  the  tap  weight  circuitry  are  such  that  the  symmetry  of  the  filter  coefficients 
is  negated,  strictly  speaking  equations  (11),  (12)  or  (13)  no  longer  apply.  Yet  because  these 
equations  are  necessary  to  ensure  linear  phase  filters  they  are  used  just  the  same,  although  the 
finite  wordlength  coefficients  obtained  using  them  will  not  be  optimal.  These  6-bit  coefficients, 
obtained  using  the  adaptive  algorithm  in  the  presence  of  tap  weight  circuitry  defects,  multiplied 
by  2®  =  32  are 

h(  1)  =  2  =  h(20) 

h(  2)  =  1  =  h(19) 

h(  3)  =  -2  =  h(18) 

h(  4)  =  2  =  h(17) 

h(  5)  =  0  =  h(16) 

h(  6)  =  -1  =  h(15) 

h(  7)  =  2  =  h(14) 

h(  8)  =  -3  =  h(13) 

h(  9)  =  -2  =  h(12) 

h(10)  =  18  =  h(ll) 
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A  comparison  of  the  magnitude  responses  is  shown  in  Figure  17.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  18. 

d.  A  length-21  Bandpass  Filter 

The  optimal  coefficient  values  of  this  particular  filter  are  unaffected  by  the 
defects;  therefore,  it  is  not  necessary  to  redesign  the  filter  taking  into 
circuitry  defects. 

e.  A  length-32  (Wide-band)  Bandpass  Filter 
The  truncated,  6-bit  coefficients,  as  represented  internally  by  the  ACT  transversal  filter  with  its 
tap  weight  circuitry  defects,  multiplied  by  2®=  32  are 


h(  1)= 

0 

h(  2)  = 

0 

h(  3)  = 

0 

h(  4)  = 

0 

h(  5)  = 

0 

h(  6)  = 

0 

h(  7)  = 

0 

h(  8)  = 

0 

h(  9)  = 

-I 

h(10)  = 

0 

h(ll)  = 

2 

h(12)  = 

0 

h(13)  = 

2 

h(14)  = 

-2 

h(15)  = 

-9 

h(16)  = 

10 

h(17)  = 

10 

h(18)  = 

-8 

h(19)  = 

-3 

h(20)  = 

2 

h(21)  = 

0 

h(22)  = 

2 

h(23)  = 

0 

h(24)  = 

0 

ap  weight  circuitry 
.  c  nt  tap  weight 
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h(25)=  0 
h(26)=  0 
h(27)=  0 

h(28)=  0 
h(29)=  0 
h(30)=  0 
h(31)=  0 
h(32)=  0 

Because  the  filter  length  is  even,  defects  in  the  tap  weight  circuitry  are  such  that  the  symmetry 
of  the  filter  coefficients  is  negated,  strictly  speaking  equations  (11),  (12)  or  (13)  no  longer  apply. 
Yet  because  these  equations  are  necessary  to  ensure  linear  phase  filters  they  are  used  just  the 
same,  although  the  finite  wordlength  coefiicients  obtained  using  them  will  not  be  optimal. 
These  6-bit  coefficients,  obtained  using  the  adaptive  algorithm  in  the  presence  of  tap  weight 
circuitry  defects,  multiplied  by  2^  =  32  are 


h(  1)=  0 

=  h(32) 

h(  2)=  0 

=  h(31) 

h(3)=  0 

=  h(30) 

h(  4)=  0 

=  h(29) 

h(  5)=  0 

=  h(28) 

h(  6)=  0 

=  h(27) 

h(7)=  0 

=  h(26) 

h(  8)=  0 

=  h(25) 

h(  9)=  0 

=  h(24) 

h(10)=  0 

=  h(23) 

h(ll)=  2 

=  h(22) 

h(12)=  0 

=  h(21) 

h(13)=  4 

=  h(20) 

h(14)=  -3 

=  h(19) 

h(15)=  -10 

=  h(18) 

h(16)=  12 

=  h(17) 

A  comparison  of  the  magnitude  responses  is  shown  in  Figure  19.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  20. 
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f.  A  length-64  (Narrow-band)  Bandpass  Filter 

The  truncated,  6-bit  coefficients,  as  represented  internally  by  the  ACT  transversal  filter  with  its 

tap  weight  circuitry  defects,  multiplied  by  2^  =  32 

are 


h(l)  =  0 
h(  2)  =  0 
h(  3)  =  0 
h(  4)  =  0 
h(  5)  =  0 
h(  6)  =  0 
h(7)  =  0 
h(  8)  =  0 
h(  9)  =  0 
h(10)  =  0 
h(ll)  =  0 

h(12)  =  0 
h(13)  =  0 
h(14)  =  0 
h(15)  =  0 
h(16)  =  0 
h(17)  =  0 
h(18)  =  0 
h(19)  =  0 
h(20)  =  0 
h(21)  =  1 

h(22)  =  0 
h(23)  =  -1 
h(24)  =  0 
h(25)  =  1 

h(26)  =  0 
h(27)  =  -1 
h(28)  =  2 
h(29)  =  2 
h(30)  =  -2 
h(31)  =  -2 
h(32)  =  2 
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< 


h(33) 

= 

•> 

h(34) 

= 

h(35) 

= 

-2 

h(36) 

= 

2 

h(37) 

= 

2 

h(38) 

= 

-17 

h(39) 

= 

-1 

h(40) 

= 

1 

h{41) 

= 

1 

h(42) 

= 

-1 

h(43) 

= 

-1 

h(44) 

= 

1 

h(45) 

= 

0 

h(46) 

= 

0 

h(47) 

= 

0 

h(48) 

= 

0 

h(49) 

= 

0 

h(50) 

= 

0 

h(51) 

= 

0 

h(52) 

= 

0 

h(53) 

= 

0 

h(54) 

= 

0 

h(55) 

= 

0 

h(56) 

0 

h(57) 

= 

0 

h(58) 

= 

0 

h(59) 

= 

0 

h(60) 

= 

0 

h(61) 

= 

0 

h(62) 

= 

0 

h(63) 

= 

0 

h(64) 

= 

0 

Since  the  defects  in  the  tap  weight  circuitry  are  such  that  the  symmetry  of  the  filter  coefficients 
is  negated,  strictly  speaking,  equations  (11),  (12)  or  (13)  no  longer  apply.  Yet  because  these 
equations  are  necessary  to  ensure  linear  phase  filters  they  are  used  just  the  same,  although  the 
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finite  wordlength  coefficients  obtained  using  them  will  not  be  optimal.  These  6-bit  coefficients, 
obtained  using  the  adaptive  algorithm  in  the  presence  of  tap  weight  circuitry  defects,  multiplied 
by  2^  =  32  are 


h(  1) 

= 

-7 

=  h(  64) 

h(  2) 

= 

2 

=  h(  63) 

h(  3) 

= 

4 

=  h(  62) 

h(  4) 

= 

4 

=  h(  61) 

h(  5) 

= 

-13 

=  h(  60) 

h(  6) 

= 

8 

=  h(  59) 

h(  7) 

= 

3 

=  h(  58) 

h(  8) 

= 

-18 

=  h{  57) 

h(  9) 

= 

12 

=  h(  56) 

h(10) 

= 

-1 

=  h(  55) 

h(ll) 

= 

-5 

=  h(  54) 

h(12) 

= 

-17 

=  h(  53) 

h(13) 

= 

10 

=  h(  52) 

h(14) 

= 

-12 

=  h(  51) 

h(15) 

= 

10 

=  h(  50) 

h(16) 

= 

12 

=  h(  49) 

h(17) 

= 

20 

=  h(  48) 

h(18) 

= 

-22 

=  h(  47) 

h(19) 

= 

31 

=  h(  46) 

h(20) 

= 

-24 

=  h(  45) 

h(21) 

= 

31 

=  h(  44) 

h(22) 

= 

12 

=  h(  43) 

h(23) 

= 

-2 

=  h(  42) 

h(24) 

= 

-27 

=  h(  41) 

h(25) 

= 

30 

=  h{  40) 

h(26) 

= 

2 

=  h(  39) 

h(27) 

= 

31 

=  h(  38) 

h(28) 

= 

9 

=  h(  37) 

h(29) 

= 

7 

=  h(  36) 

h(30) 

= 

-24 

=  h(  35) 

h(31) 

= 

-8 

=  h(  34) 

h(32) 

= 

-24 

=  h(  33) 
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A  comparison  of  the  magnitude  responses  is  shown  in  Figure  21.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  22. 

g.  A  iength-31  Bandreject  Filter 

The  truncated,  6-bit  coefficients,  as  represented  internally  by  the  ACT  transversal  filter  with  its 
tap  weight  circuitry  defects,  multiplied  by  2^  =  32  are 


h(  1) 

= 

0 

= 

h(31) 

h(  2) 

= 

0 

= 

h(30) 

h(  3) 

= 

0 

= 

h(29) 

h(  4) 

z= 

0 

= 

h(28) 

h(  5) 

= 

0 

= 

h(27) 

h(  6) 

= 

0 

= 

h(26) 

h(  7) 

= 

0 

= 

h(25) 

h(  8) 

= 

0 

= 

h(24) 

h(  9) 

= 

0 

= 

h(23) 

h(10) 

= 

-2 

= 

h(22) 

h(ll) 

= 

0 

= 

h(21) 

h(12) 

= 

0 

= 

h(20) 

h(13) 

= 

0 

= 

h(19) 

h(14) 

=: 

10 

= 

h(18) 

h(15) 

= 

0 

= 

h(17) 

h(16) 

= 

16 

= 

h(16) 

The  optimal,  6-bit  coefficients,  obtained  using  the  adaptive  algorithm  in  the  presence  of  tap 
weight  circuitry  defects,  multiplied  by  2^  =  32  are 


h(  1) 

= 

0 

=  h(31) 

h(  2) 

= 

0 

=  h(30) 

h(  3) 

= 

0 

=  h(29) 

h(  4) 

= 

0 

=  h(28) 

h(  5) 

= 

0 

=  h(27) 

h(  6) 

= 

0 

=  h{26) 

h(  7) 

= 

0 

=  h(25) 

h(  8) 

= 

0 

=  h(24) 

h(  9) 

= 

0 

=  h(23) 
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h(10)  = 

-6 

=  h(22) 

h(ll)  = 

0 

=  h(21) 

h(12)  = 

0 

=  h(20) 

h(13)  = 

0 

=  h(19) 

h(14)  = 

10 

=  h(18) 

h(15)  = 

0 

=  h(17) 

h(16)  = 

16 

=  h(16) 

A  comparison  of  the  magnitude  responses  is  shown  in  Figure  23.  A  plot  of  the  error  e(k)  versus 
the  iteration  number  is  shown  in  Figure  24. 

VIII.  RECOMMENDATIONS: 

a.  In  this  work,  an  application  of  an  adaptive  modeling  scheme  (originally  proposed  by  Stearns 
and  Widrow  [12]  — [13]  for  the  synthesis  of  FIR  filters),  is  implemented  digitally  to  design 
optimal,  finite  wordlength  FIR  filters.  The  algorithm  is  further  modified  to  design  optimal, 
finite  wordlength  FIR  filters  for  an  ACT,  programmable  analog  transversal  filter  with  known 
tap  weight  circuitry  defects.  However,  in  this  case,  the  method  is  most  effective  only  if  the  tap 
weight  defects  occur  symmetrically  so  as  not  to  negate  the  symmetry  of  the  filter  coefficients  as 
represented  internally  by  the  ACT  hardware.  Furthermore,  the  digital  algorithm  may  be  used 
to  design  optimal  finite  wordlength  FIR  digital  filters  directly  from  a  specified  set  of  frequency 
response  characteristics. 

Although  the  method  of  mixed-integer  programming  is  the  only  general  way  reported  in  the 
literature  for  optimal  finite-wordlength  coefficients  [2],  the  computational  efficiency  of  the 
proposed  algorithm  renders  it  useful  for  filter  lengths  that  would  ordinarily  be  precluded  with 
the  former  method.  Indeed,  the  bulk  of  this  work  was  done  using  only  a  desktop  PC  — XT  clone, 
equipped  with  a  math  coprocessor  chip. 

b.  The  present  work  has  been  restricted  to  the  design  of  finite  wordlength  FIR  digital  filters.  It 
is  recommended  that  the  work  be  extended  to  include  the  design  of  stable  HR  filters  with  finite 
wordlength.  Coefficient  quantization  is  a  far  more  serious  problem  in  HR  filters,  since  an  HR 
filter  with  infinite-precision  coefficients,  which  is  stable  by  design,  may  actually  manifest  itself 
as  an  unstable  filter  when  implemented  in  special  purpose  hardware.  Of  course,  in  both  FIR 
and  HR  filters,  coefficient  quantization  may  result  in  appreciable  deviation  in  the  frequency 
response  from  that  obtained  with  infinite-precision  coefficients  [17]. 
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c.  It  is  anticipated  that  the  performance  of  the  proposed  algorithm  would  be  much  improved  if 
the  number  of  bits  used  to  represent  the  infinite-precision  coefficients  were  increased.  This 
would  probably  reduce  the  number  of  “zero  tap  weights^  that  are  sometimes  encountered  when 
using  the  algorithm. 

F urther  improvement  in  performance  might  be  realized  if  the  single  adaptive  linear  combiner 
employed  in  the  present  work  is  replaced  by  a  neural  network. 
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Fig.  3:  Magnitude  Response  of 
Length-21  Loupass  Filter 
Rdaptation  Constant  =  1.632 
Number  of  Iterations  =  500 
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Fig.  13  Magnitude  Response  o-f 
Length- 31  Bands  top  Filter 
Rdaptation  Constant  --  1.0 
Number  ot  Iterations  =  1000 
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Abstract 


Reliability  prediction  of  microelectronic  devices  using  the  Finite  Element  Method  requires  accurate 
estimation  and  control  of  the  inherent  finite  element  discretization  error.  This  report  details  formal 
mathematical  approaches  by  which  the  finite  element  discretization  error  may  be  automatically 
estimated  in  a  computational  environment,  and  the  results  of  this  assessment  used  to  automati¬ 
cally  control  this  inherent  numerical  error.  To  this  end,  two  new  posteriori  finite  element  error 
estimators  and  automatic  adaptive  mesh  generation  algorithms  were  developed  and  implemented 
into  FORTRAN  codes  for  2-D  automatic  adaptive  mesh  refinement.  Both  methods  assess  the  dis¬ 
cretization  error  element  by  element  by  comparing  the  discontinuous  finite  element  scalar  function 
with  an  improved  piecewise  continuous  {C^  continuous)  scalar  function  obtained  through  postpro¬ 
cessing  of  the  finite  element  results.  The  two  error  estimators  are  distinguished  from  each  another 
by  the  manner  in  which  the  C®  continuous  scalar  function  is  obtained.  Results  are  presented  for 
2-D  elasticity  in  which  the  effective  stress  (von  Mises)  is  taken  as  the  scalar  function  field  and  the 
corresponding  norm  has  a  distortional  energy  interpretation.  Finally,  the  concept  of  adaptive  ac¬ 
curacy  is  introduced  in  a  /i-based  adaptivity  algorithm.  Examples  are  presented  which  demonstrate 
the  effectiveness  of  the  error  estimation  and  control  algorithms  as  implemented  in  a  computational 
system. 


1  Introduction 


The  hnite  element  method  is  the  most  widely  used  numerical  method  for  obtaining  approximate 
solutions  to  partial  or  ordinary  differential  equations  which  govern  complex  engineering  problems. 
The  Computer  Aided  Systems  Engineering  Branch  (RBES)  at  Rome  Air  Development  Center  has 
successfully  employed  the  finite  element  method  for  reliability  assessment  of  microelectronic  com¬ 
ponents  ([1],[2]).  Detailed  finite  element  modeling  and  analysis  of  semiconductor  chips,  leads,  weld 
joints,  etc.,  has  yielded  accurate  predictions  of  the  location  and  magnitude  of  critical  stresses  and 
temperatures  in  the  microelectronic  component  ([3]-[14]).  Based  on  these  results,  failure  modes 
can  be  predicted  and  the  mechanical  reliability  of  the  product  assessed.  While  there  is  no  sub¬ 
stitute  for  statistical  and  empirical  reliability  prediction  methods,  such  as  those  found  in  Military 
Standard  MIL-HDBK-217E,  finite  element  analysis  of  microelectronic  components  can  supplement 
these  reliability  prediction  methods  by  offering  a  deeper  understanding  of  the  physics  of  design- 
related  failure  modes.  With  this  insight  the  engineer  can  then  make  design  changes  to  improve  the 
reliability  of  the  device. 

The  finite  element  reliability  prediction  method  is  particularly  useful  as  a  reliability  design  tool  for 
proposed  new  technology  or  custom  devices  where  one  does  not  have  the  benefit  of  extensive  (or 
any)  experimental  data.  Often,  these  devices  are  quite  costly  and  time  consuming  to  develop  and 
manufacture.  It  is  critical,  therefore,  that  design-related  reliability  problems  be  revealed  and  cor¬ 
rected  early  in  the  prototype  design  phase,  before  expensive  tooling  and  processing  costs  have  been 
incurred.  Finite  element  analysis  of  the  device  can  give  the  designer  amd  engineer  the  knowledge 
needed  to  make  the  best  possible  design  decisions  in  the  absence  of  hard  experimental  data. 

However,  reliability  assessment  via  the  finite  element  method  has  been  greatly  limited  by  the  sub- 
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stantial  amount  of  labor-intensive  work  and  finite  element  modeling  expertise  required  for  au:curate 
finite  element  analysis.  Typically,  it  may  take  an  engineer  several  weeks  to  build  an  initial  detailed 
finite  element  model  of  the  microelectronic  device  on  the  computer.  Moreover,  the  numerical  re¬ 
sults  will  often  indicate  critical  regions  of  the  device  which  must  be  remodeled  in  greater  detail 
to  obtain  an  accurate  solution  for  these  regions.  Accordingly,  finite  element  reliability  assessment 
(FERA)  is  inherently  an  iterative,  and  currently  a  time-consuming,  process.  A  flow  diagram  of  this 
process  as  implemented  today  is  shown  in  Figure  1.  Several  iterations  are  usually  required  before 
accurate  results  can  be  obtained.  The  shaded  box  indicates  the  various  tasks  involving  significant 
humam  interaction.  Of  these,  the  task  of  building  geometric  models  and  generating  valid  finite 
element  meshes  are  typically  the  most  labor-intensive  and  time-consuming,  although  assessing  the 
mechanical  reliability  of  the  microelectronic  device  using  the  finite  element  results  can  also  be  a 
cumbersome  process. 

The  above  discussion  leads  to  the  following  observations.  First,  reliability  predictions  of  micro¬ 
electronic  devices  by  the  finite  element  method  are  meaningless  unless  the  finite  element  analysis 
itself  is  reliable.  Second,  reliable  finite  element  analysis  is  inherently  an  iterative  process.  In  the 
absence  of  empirical  data,  a  single  analysis  is  clearly  insufficient  for  guaranteeing  sufficiently  accu¬ 
rate  results.  Lastly,  the  utility  of  the  finite  element  method  as  a  design  and  reliability  prediction 
tool  is  greatly  limited  by  the  inefficient,  labor-intensive  iterative  process  of  analysis,  refinement, 
reanalysis,  etc. 

Obviously,  there  is  a  great  need  to  automate  this  iterative  process.  Fortunately,  research  over 
the  past  ten  years  has  begun  to  address  this  problem  through  the  development  of  various  formal 
algorithms  for  estimating  the  finite  element  discretization  error  and  for  automatically  refining  the 
mesh  based  on  these  error  estimates.  Methods  which  estimate  the  finite  element  discretization 
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error  based  on  the  analysis  results  are  called  A-posteriori  Error  Estimators.  The  process  of  refining 
the  mesh  either  locally  or  globally  based  on  a-posteriori  error  estimates  is  called  Adaptive  Mesh 
Refinement.  This  report  details  the  research  involved  in  a  one  year  project  on  automating  the 
process  of  converging  on  a  sufficiently  accurate  finite  element  solution. 

It  should  be  noted  that  the  process  of  actual  mesh  generation  has  been  greatly  facilitated  by  the 
emergence  of  powerful  automatic  mesh  generators  which  cah  now  generate  nonuniform  meshes  based 
on  specified  mesh  densities  at  specific  points.  The  advent  of  these  automatic  mesh  generators  has 
made  automatic  adaptive  mesh  refinement  possible  for  finite  element  codes. 

2  Objective 

The  objective  of  this  research  project  is  to  signicantiy  improve  and  increase  the  accuracy  and 
efficiency  of  finite-element  based  reliability  assessment  of  microelectronic  devices.  This  can  be  ac¬ 
complished  by  the  development  of  formal  error  estimation  and  adaptive  remeshing  algorithms,  the 
implementation  these  algorithms  into  computer  codes,  and  the  interfacing  of  these  codes  with  finite 
element  mesh  generation  codes,  such  as  FASTQ,  and  finite  element  analysis  codes,  such  as  NISA2, 
as  shown  in  Fig.  3.  In  this  manner  much  of  the  time-consuming  manual  effort  required  to  converge 
on  an  accurate  finite  element  solution  will  be  eliminated.  Furthermore,  the  computational  sytem 
will  provide  the  reliability  engineer  with  important  estimates  of  the  accuracy  of  the  numerical  sim¬ 
ulation.  This  will  result  in  greater  productivity  and  improved  finite  element  reliability  assessment 
of  microelectronic  devices. 

It  should  be  noted  that  the  objective  of  this  research  is  consistent  with  a  longterm  objective  of 
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automated  finite  element  reliability  assessment.  The  proposed  research  is  the  first  step  needed  to 
achieve  this  longterm  objective. 

3  Related  Work 

In  this  section  a  brief  sampling  of  related  work  in  a-posteriori  error  estimators  and  adaptive  mesh 
refinement  is  presented.  For  a  more  comprehensive  review  the  reader  is  referred  to  Shephard  [15] 
and  Babuska  et  al.  [16]. 

Turcke  and  McNeice  [17]  were  one  of  the  first  researchers  to  study  the  problem  of  assessing  the 
discretization  error.  The  authors  established  some  simple  informal  guidelines  for  using  the  strain 
energy  density  function  as  a  measure  of  the  discretization  error.  In  1977  Melosh  and  Marcal  [18] 
proposed  the  “specific  energy  difference”  (SED)  method  for  assessing  the  discretization  error.  If 
the  solution  has  converged,  then  the  strain  energy  differential  between  each  element  with  a  reduced 
degree  of  freedom  set  (i.e.  from  the  previous  mesh)  and  the  corresponding  “same”  element  with  a 
higher  degree  of  freedom  set  (i.e.  from  the  current  mesh)  must  be  zero.  Thus,  this  strain  energy 
differential  is  a  p-based  measure  of  the  discretization  error.  It  can  also  be  viewed  as  an  h-based 
error  analysis  method  if  the  corresponding  “same”  element  actually  represents  a  “set”  of  elements 
obtained  by  subdivision  of  the  element  from  the  previous  mesh.  To  avoid  two  separate  analysis, 
Melosh  and  Marcal  argued  that  this  strain  energy  differential  between  elements  of  different  meshes 
can  be  approximated  from  a  single  solution  as  the  difference  between  the  specific  strain  energy  at 
any  noncentroidal  point  and  at  the  centroid  of  the  element.  Along  a  similar  line,  Shephard  [19]  has 
proposed  an  adaptive  mesh  refinement  algorithm  based  on  the  strain  energy  density  function  and 
its  variation  of  values  within  the  finite  element  mesh.  Again,  the  discretization  error  is  measured 
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by  the  lack  of  uniformity  in  the  strain  energy  density  over  the  element  volume. 

Babuska  and  his  associates  have  offered  more  mathematically  rigorous,  residual-based  approaches 
to  error  analysis.  Babuska  and  Rheinboldt  [20]  derived  error  bounds  for  the  energy  norm  of  the 
error  based  on  the  residual  of  the  differential  equation.  Element  error  indicators  were  introduced 
as  a  means  of  determining  which  elements  must  be  refined.  Mesh  optimality  is  based  on  achieving 
equal  error  indicators  for  all  elements.  Subsequent  work  continued  to  develop  residual-based  error 
estimators  and  associated  adaptive  mesh  refinement  schemes  ([21], [22]). 

Other  residual-based  methods  include  the  work  of  Kelly,  Gago,  and  Zienkiewic2  ([23], [24]).  In 
an  attempt  to  reduce  the  computational  burden  involved  in  residual-based  error  estimates,  the 
authors  proposed  the  use  of  special  hierarchical  shape  functions.  The  hierarchical  shape  functions 
essentially  permit  efficient  p-based  error  estimates.  However,  a  common  fundamental  problem  of 
all  residual-based  methods  has  been  the  difficulty  of  correlating  the  residual,  as  measured  by  an 
integral  norm,  to  the  pointwise  error  in  either  the  primary  variable  or  its  derivatives,  such  as  stresses 
in  eleisticity. 

More  recently,  Zienkiewicz  and  his  colleagues  ([25]-[27])  have  derived  a  new  stress-based  error 
estimator  and  associated  adaptivity  algorithm.  The  method  involves  obtaining  a  global  least  squa.es 
fit  of  the  discontinuous  (C®  continuous)  finite  element  stress  field  with  a  piecewise  continuous  (C* 
continuous)  stress  field.  The  latter  stress  field  is  taken  as  an  approximation  to  the  true  stress  field, 
and  the  difference  between  the  two  tensor  fields,  as  measured  by  the  energy  or  L2  norm,  represents 
an  estimate  of  the  discretization  error.  The  authors  invoke  the  asymptotic  convergence  rate  of 
displacement-based  finite  elements  to  correlate  the  norm  of  the  error  in  the  element  stress  field  to 
the  finite  element  size,  thereby  deriving  an  adaptive  element  sizing  function  for  the  new  mesh. 
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Ainsworth  et  al  [28]  have  shown  that  the  Zienkiewicz  and  Zhu  error  estimator  is  effective  and  asymp¬ 
totically  exact  provided  that  the  exact  stress  boundary  conditions  are  imposed  on  the  higher  order 
stress  held.  However,  imposition  of  exact  stress  boundary  conditions  for  general  multi-dimensional 
problems,  while  straightforward,  involves  additional  computations.  Cauchy  stress  components  must 
be  transformed  to  boundary-based  normal/ tangential  coordinate  systems  for  all  boundary  finite  el¬ 
ements.  Moreover,  the  order  of  the  system  of  equations  generated  by  the  least  squares  fit  problem 
is,  in  general,  larger  than  the  original  finite  element  system  of  equations.  Thus,  the  algorithm  is 
computational  intensive,  unless  the  coefficient  matrix  for  the  least  square  ht  problem  is  diagonal¬ 
ized  as  recommended  by  Zienkiewicz  and  Zhu.  The  effect  of  this  diagonalization  or  lumping  of  the 
coefficient  matrix  on  the  effectiveness,  accuracy,  and  convergence  properties  of  the  algorithm  was 
not  been  explored  by  Ainsworth  et  al. 


4  Posteriori  Error  Estimators  for  h  Refinement 


To  improve  finite  element  based  reliability  predictions,  the  reliability  engineer  must  be  assured  that 
the  approximate  finite  element  solution  has  met  specified  accuracy  requirements.  In  general,  there 
are  three  sources  of  error  in  finite  element  analysis: 


•  truncation  and  roundoff  error:  This  error  is  inherent  in  any  numerical  method  due  to  the 
limited  precision  in  which  numbers  can  be  represented  on  digital  computers.  Normally,  this 
error  is  insignificant  in  finite  element  analysis  since  computations  are  carried  out  in  double 
precision  and  material  properties,  loads,  and  geometry  are  known  with  far  less  precision. 

•  modeling  error:  This  error  is  due  to  improper  modeling  assumptions,  such  as  assuming  a 
fixed  boundary  condition  when  in  fact  the  support  is  flexible.  Practicing  good  engineering 
judgement  and  verifying  results  against  assumptions  can  minimize  modeling  errors. 

•  discretization  error:  This  is  the  error  specific  to  the  finite  element  method  due  to  the  rep¬ 
resentation  of  a  continuous  object  and  the  field  solution,  such  as  the  temperature  distribution, 
in  a  piecewise  fashion  using  a  finite  number  of  elements.  Increasing  the  number  of  elements, 
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i.e.  reflaing  the  mesh,  decreases  these  errors,  provided  that  proper  elements  are  used.  This 
error  is  often  the  major  source  of  inaccuracies  in  finite  element  analysis  and  is  often  the  most 
difficult  to  assess. 

In  this  report  we  proposed  and  evaluated  two  new  error  estimators  for  measuring  the  finite  element 
discretization  error.  The  error  estimators  are  based  on  formal  mathematical  techniques  applied  to 
the  finite  element  analysis  results.  As  such,  they  are  termed  posteriori  error  estimators.  The  new 
error  estimators  and  corresponding  it  h  adaptive  h-refinement  algorithm  are  based  on  a  generalized 
scalar  energy  density  held,  or  in  the  case  of  elasticity,  on  the  effective  von  Mise’s  scalar  stress  field. 
The  fundamental  concept  of  these  error  estimators  is  similar  to  the  error  estimator  proposed  by 
Zienkiewicz  and  Zhu  [25]  in  that  the  error  estimator  is  based  on  the  comparison  of  a  discontinuous 
hnite  element  quantity  to  an  improved  piecewise  continuous  (C^  continuous)  quantity.  However, 
there  are  some  important  distinctions  which  we  will  elaborate  later. 

The  following  notation  convention  is  used  throughout  this  paper.  All  vector  quantities  are  enclosed 
by  curly  braces  {  }  and  matrices  by  square  brackets  [],  all  nodal  variables  are  represented  by  an 
overbar,  while  hnite  element  held  variables,  such  as  temperature  distribution  are  indicated  with  an 
overhat.  The  subscript  e  denotes  an  element  quantity. 

Let  {a}  denote  the  hnite  element  stress  tensor  held  which  is,  in  general,  discontinuous  across 
interelement  boundaries.  Let  {<7*}  denote  a  <7®  continuous  approximation  to  the  true  stress  held. 
An  estimate  of  the  solution  error  is  given  by  the  difference  of  these  two^sc^laf*  helds.  To  obtain 
an  expression  for  {cr*},  two  methods  were  examined:  The  Least  Squares  Fit  (LSF)  Method  and  a 
Statically  Equivalent  (SE)  Method. 
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4.1  Leaist  Squares  Fit  Method 

To  arrive  at  an  expression  for  {o*},  Zienkiewicz  and  Zhu  (25]  imposed  a  weighted  residual  equality 
on  the  stress  error  estimate,  {F}  =  {<r*}  -  {d}: 

-  {»))dV  =  {0)  (1) 

where  is  the  matrix  of  interpolating  polynomials  associated  with  the  {a*}  stress  vector  field. 
Following  the  standard  finite  element  procedure  used  to  formulate  C®  displacement  fields,  the  C® 
stress  field  is  expressed  in  terms  of  unknown  nodal  values  {d*}  using  known  nodal  interpolating 
(shape)  functions  (7V<,]; 

{„■)  =  ijv.i(r)  (2) 

Eq.  (1)  represents  a  system  o(  K  •  N  equations,  where  K  equals  the  number  of  Cauchy  stress 
components  (i.e.  3  for  2-D,  6  for  3-D),  and  N  is  the  number  of  nodes  in  the  finite  element  model. 
The  least  squares  interpretation  of  this  approach  can  be  seen  as  follows.  Let  the  functional  n({(T*}) 
denote  the  squared  Lj  norm  of  the  error  estimate: 

(3) 

By  virtue  of  Eq.  (2),  11  is  a  function  of  {d*},  the  unknown  nodal  values  of  the  projected  stress 
field.  By  substituting  for  {a*}  from  Eiq.  (2),  setting  the  variation  of  fl  to  zero  to  minimize  the 
error,  and  algebraically  manipulating,  one  can  derive  Ex}.  (1). 

From  Eqns.  (1)  and  (2),  {a')  can  be  found  to  be 

{a')^[N,][A\-^  jj^N„\^{a)dV  (4) 
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where 

[A]  =  J^[N„]^[N,]dV  (5) 

Zhu  and  Zienkiewicz  have  demonstated  the  effectiveness  of  this  stress  based  error  estimator,  and  re* 
cently  Ainsworth  et  Al.  [28]  have  provided  a  deeper  mathematical  interpretation  of  the  Zeinklewicz 
and  Zhu  error  estimator.  However,  there  aire  certain  disadvantages  of  this  method,  namely 

•  computationally  expensive,  unless  a  diagonal  approximation  for  [A]  is  employed. 

•  error  norm  estimates  computed  by  the  least  squares  method  do  not  provide  an  upperbound 
on  the  true  error  norms. 

The  least  squares  method,  if  fully  implemented,  is  computationally  expensive  because  the  order  of 
the  system  of  equations  which  must  be  factored  and  solved  is  3N  (2'D)  or  6N  (3-D),  where  N  is 
the  number  of  nodes.  In  contrast,  the  order  of  the  original  finite  element  system  of  equations  is 
2N  and  3N  for  2-D  and  3-D  problems,  respectively.  Thus,  the  computational  effort  required  for 
computing  an  error  estimate  is  actually  greater  than  that  required  for  computing  the  original  finite 
element  displacement  solution. 

The  lack  of  upperboundedness  of  the  error  norm  estimate  to  the  true  error  norm  is  a  direct  con¬ 
sequence  of  the  least  squares  fitting  of  the  discontinuous  finite  element  stress  field.  In  general,  in 
highly  stressed  regions  the  finite  element  solution  underestimates  the  stress  field.  Thus,  the  least 
squares  fit  of  the  finite  element  solution  does  not  improve  the  prediction  of  peak  stresses,  and  the 
error  estimator,  as  measured  by  an  integral  norm,  is  less  than  the  true  error  norm.  Moreover,  the 
higher  the  discretization  error,  the  greater  this  discrepancy  between  the  estimated  and  true  error 
norms.  This  condition  will  result  in  slower  convergence  (i.e.  more  mesh  refinement  iterations) 
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to  an  optimadly  refined  mesh  compared  to  that  achieved  by  an  error  norm  estimate  which  would 
upperbound  the  true  error  norm. 

The  computational  efficiency  of  the  error  analysis  can  be  significantly  improved  if  the  error  analysis 
is  based  on  a  scalar  function.  In  the  following  sections  we  present  two  new  error  estimators  which 
are  both  based  on  a  scalar  function  of  the  gradients  in  the  field  variables.  The  work  is  presented 
for  elasticity  problems  in  which  the  scalar  function  selected  is  the  von  Mises  or  effective  stress. 
However,  the  method  can  be  applied  to  any  scalar  function  which,  in  the  finite  element  solution, 
is  discontinuous  across  interelement  boundaries,  and  in  which  an  improved  scalar  function  is  C® 
continuous.  For  example,  in  thermal  analysis,  a  logical  scalar  function  is  the  thermal  energy  density 
which  is  a  function  of  the  discontinuous  heat  fluxes. 

In  elasticity  the  effective  or  von  Mises  stress  <t„  is  a  particularly  attractive  choice  for  four  reasons: 

1.  It  is  based  on  the  discontinuous  finite  element  stress  components. 

2.  A  C®  continuous  cr„  field  is  generally  an  improved  solution,  although  exceptions  do  exist. 

3.  All  elasticity  finite  element  codes  must  compute  because  of  its  failure-related  significance. 

4.  The  use  of  an  failure- related  stress  function  as  a  basis  of  error  analysis  permits  accuracy 
requirements  to  be  tightly  integrated  to  the  stress  state. 

The  last  point  should  be  emphasized.  The  Distortional  Energy  Failure  Theory  is  based  entirely 
on  von  Mises  stress  function  and  is  the  most  widely  accepted  failure  theory  for  ductile  materials. 
A  error  estimator  based  on  the  von  Mises  stress  function  enables  the  error  estimator  to  have 
engineering  sigmucance,  and  permits  the  concept  of  adaptive  accuracy  to  be  introduced.  Typically, 
an  optimal  mesh  is  defined  as  one  which  minimizes  either  the  total  error,  as  measured  by  an 
integral  norm,  or  produces  a  minimum  uniform  local  relative  error  distribution  for  a  fixed  number 


2-12 


of  degrees  of  freedom.  However,  from  a  practical  standpoint,  an  optimal  mesh  is  one  that  offers  a 
sufficiently  accurate  solution,  as  specified  by  the  user,  tn  the  critical  regions  of  the  design,  while 
allowing  for  less  accurate  solutions  in  less  critical  regions.  With  this  definition  of  mesh  optimality,  a 
dramatic  reduction  in  the  number  of  degrees  of  freedom  in  the  refined  mesh  can  typically  be  achieved 
compared  to  a  refined  mesh  in  which  the  accuracy  requirement  is  held  constant  throughout  the 
model.  Obviously,  special  care  must  be  exercised  for  problems  involving  large  deflections,  buckling, 
eigenvalue  extraction,  etc.  where  local  solution  inaccuracies  may  have  significant  effects  on  global 
finite  element  quantities. 

In  3-D,  the  von  Mises  stress  is  computed  from  the  Cauchy  stresses  as  follows: 

[(^*  -  +  (<7y  -  +  i^z  -  +  6  (rjy  -1-  -f-  (6) 

Since  Oy  is  a  function  of  the  discontinuous  finite  element  Cauchy  stress  components,  von  Mises 
stress  function  will  also  exhibit  interelement  discontinuity.  Thus,  the  discretization  error  E  can  be 
measured  by  0^  —  0’,  where  <r*  is  a  piecewise  '-^utinuous  von  Mises  stress  Held.  Zienkiewicz  and 
Zhu’s  least  squares  method  used  to  obtain  the  piecewise  continuous  stress  tensor  field  {<r*}  can  also 
be  applied  to  obtain  an  expression  for  <7*: 

=  (7) 

Jv 

where 

<  =  IW..IK)  (8) 

It  should  be  noted  that  Eq.  (7)  must  be  imposed  independently  on  subdomains  distinguished  by 
distinct  material  properties.  This  is  in  accordance  with  the  fact  that  the  effective  stress  may  in 
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reality  be  discontinuous  across  intermateriai  boundaries.  Thus,  imposition  of  continuity  across 
these  boundaries  would  violate  physical  principles  and  weaken  the  accuracy  of  the  error  estimator. 

Eq.  (7)  is  solved  to  yield 

<  =  (9) 

where 

[A]  =  J[N„.f[N„,]dV  (10) 


Note  that  since  <r„  is  a  scalar  function,  the  matrix  [A]  to  be  inverted  is  of  order  JV  only.  The 
Cholesky  decomposition  and  forward  and  back  substitution  of  a  symmetric  system  of  equations  of 
order  n  involves  (n^  +  n)/2  floating  point  operations  (FLOPS)  [29].  Thus,  the  ratio  of  FLOPS 
required  for  obtaining  the  stress  vector  field  {a*}  to  the  FLOPS  required  for  obtaining  the  scalar 
function  function  is  approximately  where  K  equals  3  for  2-D  and  6  for  3-D  problems. 
One  may  argue  that  this  computational  savings  is  offset  to  some  extent  by  the  FLOPS  required 
to  compute  <7„  via  Ek}.  (6).  However,  the  engineering  significance  of  cr„  dictates  its  computation 
irrespective  of  any  posteriori  error  analysis. 

If  a  diagonal  approximation  for  [A]  is  employed,  then  the  FLOP  ratio  of  the  two  methods  is 
approximately  K.  However,  the  theoretical  effect  of  a  diagonal  approximation  to  [A]  on  the  error 
estimator  has  yet  to  be  explored. 

Although  the  von  Mises  stress-based  error  estimator  obtained  by  the  least  squares  fit  approach 
provides  a  measure  of  the  discretization  error  and  is  computationally  efficient,  it  possesses  one 
notable  disadvantage.  It  underestimates  the  peak  von  Mises  stresses,  and  therefore  the  estimated 


2-14 


error  norms  underestimate  the  true  error  norms  for  most  critical  analysis  regions  of  the  finite 
element  model.  In  the  following  section,  we  propose  an  alternative  formulation  for  obtaining  o* 
and  an  improved  error  estimator.  The  two  approaches  will  be  compared  in  the  Examples  section. 


» 


» 


» 
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4.2  The  Statically  Equivalent  C®  Stress  Field  Method 

This  method  was  initially  proposed  by  Loubignac  et  Al.  (30)  to  improve  the  estimation  of  peak 
stresses.  Modifications  have  been  introduced  by  other  researchers  more  recently  ([31], [32]). 

The  static  equilibrium  equations  are  given  by 

[K]{D}  -  W  =  0  (11) 

where 

[K]  =  /  [Bf[E][B]dV  (12) 

Jv 

Thus 

( J^lBnElB]dV){D)  -  {«)  =  0  (13) 

We  note  that 

{i)  =  (£|(B|{C)  (14) 

Therefore  Eq.  (13)  also  implies  that 

[  [Bf{a}dV-{R}  =  0  (15) 

Jv 

Thus,  the  discontinuous  finite  element  stress  field  {d}  satisfies  static  equilibrium  by  virtue  of  Eq. 
(15).  If  {a}  is  replaced  with  a  continuous  stress  field  '{ct*},  Eq.  (15)  will  no  longer  be  satisfied 
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and  a  residual  will  exist  due  to  the  disatisfaction  of  static  equilibrium.  Let 

{fl,}  =  /  =  f;  /  [B]'^{y).dV  (16) 

Comparing  this  with  the  applied  load  vector,  the  residual  load  vector  {A^}  is  obtained.  Thus,  a 
Newton- Raphson  iteration  algorithm  may  be  employed  to  update  nodal  displacements  and  finite 
element  stresses  until  convergence,  i.e.  static  equilibrium  of  the  (7®  continuous  {<t*}  field,  is  achieved 
in  some  sense. 


=  E  /  [B]l{arjv 

(17) 

{ARY 

=  w-iJi.)' 

(18) 

[K]{ADy 

= 

(19) 

=  {C)'  +  {AC)‘ 

(20) 

(21) 

The  C®  continuous  stress  field  {o’*}  required  in  E»q.  (17)  may  be  obtained  by  simple  nodal  avera^ng 
of  {d}  stresses  (extrapolated  from  Gauss  points)  to  obtain  {d*}  and  then  interpolating  for  {a*} 
via  Eq.  (2).  To  adopt  this  method  to  our  effective  stress  based  error  estimator,  the  {<y*}  stress 
components  can  be  used  to  directly  compute  o’*  via  Ea|.  (6). 

An  appealing  feature  of  this  method  is  that  the  decomposed  global  matrix  [/if]  used  in  the  original 
finite  element  analysis  may  be  used  for  all  Newton-Raphson  iterations.  Furthermore,  this  method 
has  also  been  observed  to  overestimate  peak  stresses  with  the  magnitude  of  peak  stress  overestima¬ 
tion  being  proportional  to  the  coarseness  of  the  mesh  (i.e.  higher  discretization  errors).  Typically, 
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only  a  few  iterations  are  needed  before  peak  stresses  are  overestimated.  Therefore,  error  estimations 
based  on  <t*  computed  by  this  method  tend  to  be  conservative  (i.e.  overestimate  the  true  error) 
for  peak  stress  regions.  This  improves  the  convergence  characteristics  of  the  adaptivity  algorithm 
in  these  critical  regions. 

5  Assessing  the  Discretization  Error 

Ideally,  one  would  like  to  control  the  discretization  error  in  a  pointwise  sense,  since  the  ultimate 
objective  of  adaptive  mesh  refinement  is  to  force  the  pointwise  error  in  the  finite  element  solution 
to  be  acceptable  throughout  the  domain.  However,  in  practice,  norms  must  be  used  to  measure 
the  error  element  by  element  and  modify  the  mesh  accordingly. 

The  Lj  integral  norm  of  a  function  /  is  given  by 

11/(1, y, 2)112  =  ^J^fix,y,z)dV  *  (22) 

We  note  that  the  projected  field  <jl  is  a  continuous  field  which  minimizes  the  L2  norm  of  the 
difference  between  <7*  and  over  the  entire  domain.  Therefore,  a  rather  natural  measure  of  the 
estimated  discretization  error  for  element  e,  E(^a„)  is: 

-  '<112  =  (V.  -  '<)*  <iv]  (23) 

Note  that  we  have  slightly  modified  our  notation,  using  the  pre-superscript  e,  instead  of  the  sub¬ 
script  e,  to  donote  an  element  quantity. 

If  the  L2  norm  is  normalized  by  dividing  by  then  it  represents  an  integral  root  mean  square 
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(RMS)  value  of  the  function.  Thus,  an  RMS  measure  of  the  element’s  von  Mises  stress  field  is 

RMS{^a„)  =  r^]  '  (24) 

It  is  observed  that  the  term  in  parenthesis  in  Eq.  (24)  is  directly  proportional  to  the  element’s 
distortional  strain  energy.  Denoting  the  element’s  distortional  strain  energy  as  ‘DSE,  we  have  the 
following  relationships  between  the  distortional  strain  energy,  the  RMS  and  L2  norm  of 

‘DSE  <x  ‘V(RMSi‘(T„)f  =  \\‘<r^\\l  (25) 

Accordingly,  the  norm  of  the  element’s  von  Mises  stress  field  is  proportional  to  the  square  root 
of  the  element’s  distortional  strain  energy,  and  the  RMS  measure  of  the  element  von  Mises  stress 
field  is  proportional  to  the  squaire  root  of  the  element’s  average  distortional  strain  energy  density. 

Finally,  a  global  RMS  measure  of  the  von  Mises  stress  is  given  by 


GRMSM  = 

(26) 

(27) 

(28) 

where  m  equals  the  number  of  elements.  As  before,  we  identify  the  term  in  large  parenthesis  in  Eq. 
(26)  to  be  proportional  to  the  total  distortional  strain  energy  (TDSE),  yielding 

TDSE  «  V(GRMS(ojy  =  Ml  (29) 
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6  Adaptive  Mesh  Refinement 


An  adaptive  mesh  refinement  scheme  is  simply  an  algorithm  by  which  the  results  (i.e.  stress  or 
temperature  distribution)  of  a  finite  element  analysis  can  be  used  to  remesh  or  refine  the  finite 
element  model  to  improve  the  level  of  accuracy  of  the  solution.  Adaptive  meshing  techniques  can 
be  classified  into  three  categories: 


1.  r  Refinement:  In  this  approach  neither  the  connectivity  of  the  finite  element  mesh  or  the 
order  of  the  finite  elements  is  changed.  The  new  mesh  contains  the  same  number  of  nodes, 
elements,  and  nodal  connectivity  used  to  define  each  element.  However,  the  nodal  locations 
are  moved  to  obtain  a  more  optimal  solution.  The  repositioning  of  the  nodes  may  be  based 
on  a  local  error  estimate  or  on  the  minimization  of  the  total  potential  energy  with  respect  to 
the  nodal  coordinates. 

2.  h  Refinement:  This  is  the  most  common  type  of  mesh  refinement  scheme.  It  is  often 
practiced  in  an  intuitive  manner  by  analysts  without  any  formal  error  estimate  computations. 
Ideally,  error  estimates  are  used  to  increase  the  number  of  elements  and  nodes  locally  and/or 
globally  to  obtain  a  more  optimal  mesh.  The  order  of  the  element  (linear,  quadratic,  etc.) 
remains  unchanged. 

3.  p  Refinement:  This  is  a  relatively  new  method  where  the  number  of  elements  and  element 
shape  remain  the  same.  Instead,  the  order  of  the  element  is  increased  by  the  addition  of 
midside  and  interior  nodes.  Higher  order  elements  used  higher  order  polynomials  to  represent 
the  displacement  or  temperature  field.  The  accuracy  of  the  solution  improves  because  of  the 
additional  nodal  degrees  of  freedom  in  the  new  mesh. 


Each  method  has  its  own  advantages  and  disadvantages.  In  r  refinement,  the  most  optimal  mesh 
can  be  obtadned  for  a  fixed  number  of  degrees  of  freedom.  However,  there  is  no  guarantee  that 
the  number  of  degrees  of  freedom  in  the  mesh  is  sufficient  to  satisfy  the  accuracy  requirements 
of  the  user.  The  h  refinement  method  has  the  advantage  of  being  applied  locally  and/or  globally 
to  force  the  solution  error  to  within  acceptable  bounds.  However,  mesh  restructuring  is  required 
which  usually  dictates  the  need  for  a  robust  automatic  mesh  generator.  The  p  refinement  method 
has  shown  promising  results  compared  to  the  h  refinement  method.  However,  it  is  more  difficult  to 
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implement  and  requires  access  to  the  finite  element  source  code  which  is  not  required  by  the  other 
two  techniques. 


Recently,  a  powerful  2-D  automatic  mesh  generation  code  has  been  developed  at  Sandia  National 
Laboratory  called  FASTQ  and  currently  installed  on  the  Computer  Aided  Systems  Engineering 
Vax  computer  at  Rx>me  Air  Development  Center.  Due  to  the  availability  of  this  automatic  2-D 
mesh  generator  and  due  to  the  lack  of  source  code  access  to  commercial  finite  element  codes,  an  h 
refinement  adaptive  meshing  strategy  was  selected  for  this  research. 

The  discretization  error  is  assessed  using  element  norms.  For  this  we  have  employed  the  norm 
of  the  error  defined  in  Ek[.  (23).  Now,  the  strategy  is  to  force  this  measure  of  the  error  to  be 
sufficiently  small  relative  to  a  reference  value  in  order  to  achieve  the  desired  pointwise  accuracy  in 
Ov.  The  Lj  norm  of  the  projected  (i.e.  assumed  exact)  von  Mises  stress  field,  ||<Tv||2  is  taken  as  the 
reference  value.  Thus,  the  accuracy  criteria  is 

(30) 

where  ri  is  small.  However,  difficulty  lies  in  determining  whether  ||<Tv||2aorm  should  be  computed 
locally  or  globally.  If  this  norm  is  computed  locally  (i.e.  over  the  element  domain),  then  the 
adaptivity  algorithm  will  be  excessively  demanding  (25).  In  effect,  a  uniform  relative  accuracy 
requirement  will  be  imposed  for  all  elements  regardless  of  the  magnitude  of  the  element  stress.  The 
absolute  estimated  error  may  be  very  small  for  an  element,  but  because  the  element  is  understressed, 
11®<t*||2  is  also  small,  and  the  algorithm  wiU  demand  refinement  in  spite  of  the  small  absolute  error. 
On  the  other  hand,  if  ||<7*||2  in  Eq.  (30)  is  computed  globally  by  summing  up  ll'o-*|l2  for  ail  elements, 
then  Tj  loses  its  significance  as  the  relative  error  norm  ratio  for  each  element. 
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In  an  attempt  to  resolve  this  delimma,  an  adaptive  accuracy  criteria  is  introduced  which  can  yield 
“optimal”  meshes  from  a  practical  viewpoint.  Let  tj  denote  a  nominal  target  error  fraction  of  the 
global  L2  norm  of  the  stress  held.  Then,  the  following  accuracy  criteria  is  used: 


ii^;(v„)ib  =  ii*^v  -  v:ii,  <  *i7(iia:ii,)„ 


(31) 


where 


V  =  n 


(GRMSial) 
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K<)\ 
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(32) 

(33) 


where  a  is  a  constant  >  0  which  controls  the  degree  of  accuracy  adaptivity.  If  a  =  0,  then  a 
nonadaptive  accuracy  requirement  is  imposed.  It  is  observed  that  the  effect  of  Eq.  (32)  is  to 
demand  the  greatest  accuracy  for  the  element  with  the  highest  RMS  measure  of  and  to  relax 
the  accuracy  requirement  for  all  elements  where  RMS(‘<rl)  <  GRMS(<rl).  Practical  bounds  must 
be  placed  on  *tj  computed  by  Eq.  (32)  to  prevent  mesh  transitioning  problems.  For  example,  'q 
may  be  permitted  to  vary  only  from  0.01  to  0.20  with  a  nominal  value  of  0.05. 


Note  we  have  selected  the  maximum  element  L2  norm  of  er*  as  the  reference  value  in  Ek}.  (31)  to 
which  the  element  norm  of  the  error,||E(^<7„)||2,  are  compared.  This  choice  represents  a  compromise 
between  a  purely  element- based  reference  value  and  a  purely  global-based  reference  value.  Numer¬ 
ous  numerical  experiments  based  on  local,  global,  and  local/global  reference  values  indicated  this 
choice  avoided  the  problem  of  excessive  mesh  refinement  in  understressed  regions,  while  retaining 
the  ability  of  the  mesh  to  converge  quickly  to  sufficiently  accurate  solutions  in  highly  stressed  re¬ 
gions.  In  a  similar  manner  we  have  found  that  an  accuracy  adaptivity  constant  of  a  »  2.0  yields 
good  results  in  terms  of  adapting  the  accuracy  of  the  solution  to  the  rate  of  change  of  the  stress 
field.  However,  more  work  is  clearly  needed  in  this  area. 
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While  the  error  norm  ratio  will  not  bound  the  pointwise  error  £{0^),  the  pointwise  error 

will  generally  converge  faster  than  the  error  norm  ratio  as  the  mesh  is  refined.  Thus,  the  strategy 
is  to  use  the  error  norm  ratios  as  a  basis  for  adaptivity,  while  monitoring  the  relative  estimated 
pointwise  errors  in  a„  at  the  peak  stress  points  in  the  mesh. 

An  A-Based  Adaptive  Remeshing  Scheme: 

Let  e  denote  the  right  hand  side  of  Eq.  (31).  Define  the  error  ratio  ^  as 

(34) 

e  e 

U  >  1»  then  the  element  size  he  is  decreased.  If  <  1,  then  we  increase  the  element  size  he.  If 
^  =  1  for  all  elements,  then  the  desired  adaptive  accuracy  requirement  has  been  achieved. 

To  obtain  appropriate  values  by  which  to  increase  or  decrease  element  size  h,  the  asymptotic 
convergence  rate  criteria  is  invoked  at  the  element  level.  If  the  order  of  the  assumed  polynomial 
used  to  approximate  the  field  variable  is  p,  then  one  can  expect  a  convergence  rate  of  the  field 
variable  of  order  since  higher  order  terms  have  been  omitted  from  the  assumed  field.  Similarly, 
the  convergence  rate  for  derivatives  of  the  field  variable,  such  as  strains  or  temperature  gradients, 
will  have  a  convergence  rate  of  order  where  I  is  the  order  of  differentiation  required  to 

obtain  the  derivative  variables.  One  can  expect,  then,  that  the  convergence  rate  of  E(‘(7„)  will  be 
of  order  Since  the  Lj  norm  of  is  given  by  the  square  root  of  the  integral  of  fJ(*<7„), 

it  is  argued  that  the  convergence  rate  for  ||E(*<t„)H2  is 

CRiWECaM  «  [ / =  (/i2(p+i-/)+i)i/2  (35) 

For  unear  elements  and  2D  and  3D  elasticity,  p  =  I  and  /  =  1,  yielding 


CR{\\E{‘aM  «  h^f^ 


(36) 


Denoting  the  current  mesh  with  the  pre-subscript  i,  the  requirement  that  =  1  with  Eq.  (36) 


yields 


Thus, 


For  linear  elements,  one  has 


H  e  ?h(2p-n)/2 

=?  f  = 


‘+1^  et2/3 


7  Automatic  Adaptive  /i-refinement  Mesh  Generation 


Recently,  a  new  automatic  mesh  generation  technique  has  been  developed  at  Sandia  National  Lab¬ 
oratories  which  is  ideally  suited  for  adaptive  h- refinement.  The  technique,  called  “paving”  in  2-D 
and  “plastering”  in  3-D,  uses  a  boundary  description  of  the  geometry  and  user-specified  element 
sizing  functions  along  the  boundary  segments  to  generate  all  quadralateral  meshes.  Essentially, 
rows  of  elements  are  laid  down,  element  by  element,  in  a  counter  clockwise  direction  around  the 
exterior  perimeter  preceding  inward  and  in  clockwise  directions  around  interior  perimeters  (i.e. 
holes)  preceding  outward.  Sophisticated  algorithms  are  employed  to  transition  between  different 
element  sizes  and  to  correct  the  mesh  where  rows  of  elements  overlap.  The  meshing  algorithms  are 
implemented  in  a  large  FORTRAN  code  called  FASTQ  (FAST  Quadralaterals).  The  reader  is  re¬ 
ferred  to  Reference  [33]  for  more  details  of  this  powerful  mesh  generation  technique  for  quadrilateral 
meshes. 
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Because  elements  are  inserted  into  the  mesh  one  by  one,  the  algorithm  is  well  suited  for  adaptive 
meshing.  Details  of  this  algorithm  will  be  forthcoming. 


8  Implementation 

The  two  error  analysis  approaches  ot  the  previous  section  were  implemented  into  two  separate 
in-house  finite  element  codes,  called  ERRFEMl  and  ERRFEM2.  The  FASTQ  2-D  automatic 
mesh  generation  code,  discussed  above,  was  modified  to  implement  adaptive  meshing  according  to 
Eq.  (39).  Subroutines  were  also  added  to  FASTQ  for  automatic  generation  of  complete  input  data 
files  for  ERRFEMl  and  ERRFEM2  based  on  user-supplied  data  through  interactive  session. 

Note  that  the  error  estimation  algorithms  have  been  implemented  into  in-house  finite  element 
codes,  instead  of  implemented  as  separate  stand-alone  codes  which  interface  to  commercial  finite 
element  codes  as  ori^nally  planned  (Fig.  2).  This  was  necessary  due  to  the  technical  difficulties  of 
interfacing  to  a  commercial  code,  the  research  nature  of  this  work,  and  the  time  constraints  on  the 
project. 

The  sequence  of  activities  with  the  error  analysis  and  automatic  adaptive  mesh  generation  is  illus¬ 
trated  in  Fig.  3.  First,  the  user  defines  an  initial  boundary  representation  of  the  2-D  geometry  with 
associated  meshing  data  either  interactively  in  FASTQ  or  through  the  preparation  of  a  FASTQ 
input  data  file.  Meshing  data  consists  of  simply  the  type  of  element  (four-,  eight-,  or  nine-noded), 
the  number  of  elements  along  each  given  boundary  segment,  the  uniformity  of  element  spacing 
along  each  boundary  segment,  and  the  type  of  mesh  generation  technique  (i.e.  paving  technique). 
Next,  FASTQ  uses  this  information  and  the  paving  algorithm  to  automatically  generate  an  initial 


2-24 


finite  element  mesh.  The  user  then  interactively  requests  that  an  input  file  be  generated  for  the 
finite  element/error  analysis  codes.  The  system  then  prompts  the  user  for  additional  finite  element 
modeling  data,  such  as  material  properties,  and  loading  and  boundary  conditions.  After  speciiicar 
tion,  the  input  file  for  ERRFEMx  is  written,  as  well  as  a  FASTQ  binary  mesh  database  file  and 
a  FASTQ  input  file. 

The  finite  element /error  analysis  code  is  then  executed.  Execution  causes  the  input  file  to  be  read, 
a  finite  element  analysis  to  be  performed,  error  ratios  to  be  calculated  for  each  element,  and  the 
error  ratio  results  to  be  appended  to  the  end  of  the  binary  mesh  database  file.  FASTQ  is  invoked, 
and  the  user  selects  the  remeshing  option  under  the  mesh  processing  module.  The  system  then 
prompts  the  user  for  the  name  of  the  FASTQ  input  data  file  and  the  binary  mesh  database  file. 
The  paving  mesh  generation  algorithm  is  executed  where  new  elements  are  sized  based  on  the  error 
ratios  contained  in  the  database  file  and  old  element  sizes  according  to  Eq.  (39).  The  geometry 
definition  information  contained  in  the  FASTQ  input  data  file  is  used  to  ensure  that  remeshing  is 
based  on  the  originally  defined  geometry,  and  not  the  geometry  defined  by  the  previous  mesh. 

Note  that  this  process  could  have  been  streamlined  considerably  by  merging  the  FASTQ  code 
with  the  ERRFEM  codes.  However,  the  resulting  loss  of  modularity  would  have  hindered  future 
applications  involving  stand  alone  commercial  finite  element  codes,  stand  alone  error  analysis  codes, 
and  stand  alone  automatic  adaptive  mesh  generation.  For  this  reason  it  was  decided  to  keep 
adaptive  mesh  generation  and  finite  element  and  error  analysis  separate.  Furthermore,  work  is 
currently  underway  on  implementing  the  error  analysis  algorithms  in  a  stand  alone  code  which  will 
interface  to  FASTQ  for  mesh  generation  and  NISA2  for  finite  element  analysis. 
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9  Examples 


A  comparison  of  the  proposed  error  estimators  and  h-based  adaptivity  algorithms  is  illustrated  with 
the  following  2-0  examples  involving  four-noded  bilinear  quadrilateral  elements.  The  first  example 
presented  is  the  2-D  plane  stress  problem  of  a  thin  plate  with  hole  under  tension  (Fig.  4(a)).  The 
second  example  studied  is  the  endloaded  cantilever  beam  shown  in  Fig.  4(b). 

The  initial  coarse-mesh  finite  element  model  representing  one  fourth  of  the  plate  is  shown  in  Fig. 
5(a),  with  the  corresponding  nodal  averaged  <7„  stress  distribution  shown  in  Fig.  5(b),  where  the 
stress  distribution  has  been  nondimensionalized  by  the  applied  traction  oq.  The  relative  pointwise 
error  (RPWE)  is  defined  as 

rpwe  t-'  (KW-^v) 

(<^v)exact 

and  has  a  value  of  23.5%  at  the  critical  stress  point  (top  of  hole)  for  this  me*  H.  The  “exact”  won 
Mises  stress  value  (4.35)  was  obtained  using  a  fine  mesh  consisting  of  1400  elements,  1491  nodes, 
and  2940  active  degrees  of  freedom.  A  series  of  convergence  studies  established  that  this  mesh  was 
sufficiently  accurate  to  represent  the  “exact”  stress  distribution. 

In  Figures  5(c)-4  results  are  presented  for  the  least  squares  fit  (LSF)  method  for  obtaining  a*. 
A  nominal  target  accuracy  ratio  of  ?/  =  0.05  was  used  with  upper  and  lower  limits  of  0.01  and 
0.20  placed  on  the  adaptive  accuracy  ratio  *77.  In  Figs.  5(c)  we  show  the  distribution  of  adaptive 
accuracy  ratio  given  by  Eq.  32.  The  corresponding  error  ratio  distribution  given  by  Eq.  (34)  are 
plotted  in  Fig.  5(d).  The  maximum  error  ratio  is  4.86  corresponding  to  *77  =  0.014  and  occurs  at 
the  critical  stress  element. 
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Employing  the  new  element  size  adaptivity  function  given  by  E^.  (39)  and  the  “paving”  mesh 
generation  technique,  the  first  mesh  refinement  is  shown  in  Fig.  6(a).  To  avoid  possible  mesh 
transitioning  problems,  elements  with  error  ratios  less  than  one  were  not  allowed  to  increase  in 
size.  Again,  the  von  Mises  stress  distribution  is  shown  in  Fig.  6(b),  and  the  relative  pointwise 
error  at  the  critical  stress  point  is  now  0.10.  Adaptive  accuracy  distribution  and  corresonding  error 
ratio  distribution  are  shown  in  Figs.  6(c)  and  6(d).  Note  that  the  maximum  error  ratio  has  been 
reduced  from  4.86  to  3.30,  while  the  relative  pointwise  error  in  at  the  top  of  hole  has  decreased 
from  23.5%  to  10.0%.  As  expected,  the  pointwise  error  converges  faster  than  ♦^he  norm  of  the  error. 

Figs.  7(a)-7(d)  show  the  third  and  final  remesh  results  for  the  LSF  method.  The  relative  point- 
wise  error  at  the  critical  stress  point  and  the  maximum  error  ratio  are  now  and  0.045  and  1.40, 
respectively. 

Analogous  results  for  the  statically  equivalent  (SE)  von  Mises  stress  based  method  are  shown  in 
Figs.  8-10.  Again,  a  target  accuracy  of  t/  =  0.05  was  sought.  Six  Newton- Raphson  iterations  were 
performed  for  each  analysis  to  obtain  the  statically  equivalent  piecewise  continuous  von  Mises  stress 
field.  The  results  indicate  improved  convergence  characteristics  of  this  method,  as  evidenced  by 
the  higher  maximum  error  ratios  for  the  same  coarse  mesh  and  faster  convergence  of  the  pointwise 
percentage  error  at  the  critical  stress  point. 

The  effect  of  adaptive  accuracy  factor  a  in  Eq.  (32)  was  explored  by  setting  a  =  0  and  repeating 
the  analysis  using  the  statically  equivalent  stress  based  method.  The  result  was  poor  remeshes  due 
to  distorted  elements.  The  error  ratios  were  found  to  be  higher  at  the  distorted  elements  than  at 
the  critical  stress  region.  Thus,  refinement  occurred  at  understressed  but  distorted  element  regions 
initially.  After  several  remeshes,  refinement  did  proceed  to  the  critical  stress  region,  albeit  slowly, 
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but  the  target  accuracy  was  not  reached  before  the  error  ratio  converged  to  one.  The  implication 
of  this  observation  will  be  discussed  in  the  following  section. 

Results  for  the  plate  with  hole  problem  are  summarized  in  Fig.  II  which  illustrates  the  convergence 
characteristics  of  both  the  pointwise  error  and  the  error  norm  ratio  of  the  two  methods.  From  Fig. 
11(a)  one  can  see  the  superior  convergence  characteristics  of  the  SE  method  compared  to  the  LSF 
method  in  both  the  error  norm  ratio  (ENR)  lor  the  critical  element  and  the  relative  pointwise  error 
(RPWE)  at  the  critical  stress  point.  In  Fig.  11(b)  the  effect  of  adaptive  accuracy  on  convergence  is 
clearly  demonstrated.  The  nonadaptive  approach  exhibited  relatively  slow  convergence  characteris¬ 
tics.  The  increasing  slope  of  both  the  ENR  and  RPWE  curves  for  the  adaptive  accuracy  approach 
indicates  that  accuracy  is  increasing  at  a  faster  rate  than  the  rate  degrees  of  freedom  are  increasing, 
a  phenomenon  attributable  to  the  loc-.lized  mesh  refinement  in  the  vicinity  of  the  critical  stress 
point. 

In  the  beam  example  only  the  SE  method  was  used  to  obtain  the  improved  a*  stress  field.  The 
initial  uniform  finite  element  mesh  for  the  beam  problem  is  shown  in  Fig.  12(a).  For  convenience 
Poi'coa  ratio  was  set  to  zero.  The  won  Mises  stress  distribution,  nondimensionalized  by  the  applied 
traction  oo  is  shown  in  Fig.  12(b).  At  the  critical  stress  point  at  the  left  edge  on  the  top  or  bottom 
of  the  beam,  the  relative  pointwise  error  is  11.3%  for  this  coarse  mesh.  Figs.  12(c)  and  12(d)  show 
the  adaptive  accuracy  and  error  ratio  distributions  obtained.  Figures  13  and  14  present  the  results 
for  the  first  and  third  (final)  remeshes.  These  results  are  summarized  in  Fig.  15.  Note  the  excellent 
convergence  of  the  pointwise  error  to  the  target  accuracy  in  the  first  remesh. 
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10  Discussion 


In  the  plate  example  the  error  norm  ratios  for  the  critical  stress  element  were  found  to  be  less  than 
the  relative  pointwise  error  at  the  critical  stress  point  for  a  given  mesh.  The  opposite  was  true  for 
the  beam  problem.  This  result  is  to  be  expected  since  the  error  norm  does  not  bound  the  pointwise 
error.  Since  the  norm  of  the  error  reflects  an  “average”  error  over  the  element  volume,  one  can 
expect  in  areas  of  high  stress  concentration,  such  as  at  the  top  of  the  hole  in  the  plate,  the  relative 
pointwise  error  to  exceed  the  error  norm  ratio.  Therefore,  convergence  of  the  error  ratio  to  one 
is  not  a  sufficient  condition  for  obtaining  the  target  accuracy  in  a  pointwise  sense.  In  'r>ractice  we 
have  observed  that  imposing  an  adaptive  accuracy  requirement  according  to  £q.  32  has  proven  to 
be  effective  in  driving  the  critical  pointwise  error  to  be  within  the  tar'^et  accuracy  as  the  error  ratio 
approaches  unity.  However,  more  theoretical  and  numerical  work  is  needed  to  provide  an  improved 
correlation  in  some  sense  of  the  relative  pointwise  error  to  an  error  norm  ratio. 

Two  interesting  observations  are  noted  for  the  beam  pro^'em.  First,  by  comparing  Figs.  13(a), 
13(c),  and  13(d),  it  is  clear  that  the  SE  method  predicts  a  relatively  high  error  ratio  for  distorted 
elements.  Secondly,  the  final  remesh  shows  relatively  small  element  sizes  on  the  beam’s  neutral  axis 
where  the  y  boundary  conditi  'n  constraint  is  imposed.  Although  stresses  are  not  high  at  this  point, 
a  theoretical  discontinuity  exists  in  the  (7y  stress  across  the  interelement  boundary  at  this  point 
d’  the  reaction  force  applied  by  the  boundary  condition  constraint.  The  result  is  a  theoretical 
discontinuity  in  the  von  Mises  stress  field  at  this  point.  Therefore,  the  smoothed  continuous  von 
Mises  stress  field  at  this  point  does  not  represent  an  improved  solution.  Future  work  must  account 
for  such  situations  if  stress  based  error  estimators  are  to  be  reliably  used  for  automatic  adaptive 
mesh  refinement. 
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11  Conclusions 


Two  new  von  Mises  or  eiTective  stress  based  error  estimators  were  introduced  and  compared.  An 
effective  stress  based  error  estimator  offers  some  significant  advantages  over  residual  and  other 
stress-based  error  estimators.  First,  because  it  is  a  scalar  function,  the  error  estimator  is  com¬ 
putationally  efficient  to  compute.  Second,  it  provides  a  natural  means  by  which  the  concept  of 
adaptive  accuracy  can  be  introduced,  thereby  permitting  optimal  meshes  from  an  engineering  or 
failure-oriented  perspective.  Third,  because  the  error  estimator  is  based  on  a  scalar  function,  the 
method  is  easily  extendable  to  3-D  problems.  Fourth,  the  method  can  be  easily  generalized  for 
other  problem  domains  by  using  similar  scalar  functions  of  engineering  significance.  For  example, 
in  thermal  analysis,  the  thermal  energy  density  function  is  a  natural  scalar  function  to  use  as  a 
basis  for  error  estimation. 

Both  methods  demonstrated  convergence  of  the  critical  pointwise  error  for  the  test  case  problems 
studied.  The  statically  equivalent  error  estimator  method  was  seen  to  offer  superior  convergence 
characteristics  than  the  least  squares  fit  method.  However,  for  a  given  mesh  the  SE  method  is 
computationally  more  expensive  than  the  least  squares  fit  method,  especially  if  a  local  least  squares 
fit  method  is  employed  by  simply  averaging  element  stresses  extrapolated  to  nodes.  Thus,  the 
superior  convergence  characteristics  of  the  SE  method  is  at  the  expense  of  increase  computational 
costs  for  a  given  mesh.  However,  for  most  applications,  it  is  preferable  to  minimize  the  number 
of  mesh  refinements  needed  because  of  the  real  time  involved  and  computer  resources  needed  to 
manage  multiple  finite  element  models.  In  view  of  this  concern,  the  SE  method  is  more  attractive 
than  the  LSF  method.  A  judicious  combination  of  the  two  methods  may  afford  an  optimal  path  to 
a  sufficiently  accurate  solution. 
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12  Future  Work 


The  finite  element  and  error  analysis  codes  ERRFEMl  and  ERRFEM2  are  useful  research  tools 
for  quickly  implementing  and  testing  error  estimation  algorithms.  However,  to  be  of  greater  utility 
to  the  Air  Force,  the  error  estimation  algorithms  must  also  have  the  capability  to  accept  finite 
element  results  obtained  from  commercial  finite  element  codes  such  as  NISA2.  In  addition,  the 
two  error  estimation  algorithms  should  be  integrated  into  a  single  code.  Thus,  the  first  task  on  the 
agenda  is  to  bring  ERRFEMl  and  ERRFEM2  error  analysis  codes  together  into  a  single  finite 
element/error  analysis  computer  program  which  will  interface  to  FASTQ  data  files  for  mesh  data 
and  to  NISA2  output  files  for  analysis  data,  if  desired.  This  task  is  already  underway  and  is  70% 
completed  in  the  form  of  a  C  computer  code.  The  code  is  scheduled  to  be  completed  by  May  1, 
1991. 

Future  work  will  also  include  extension  of  the  error  estimation  algorithms  to  thermal  analysis 
problems.  Extension  to  thermal  analysis  is  critical  since  thermally  induced  stresses  and  strains 
often  result  in  microelectronic  device  reliability  problems.  Thermal  analysis  error  estimation  can 
be  achieved  by  replacing  the  von  Mises  stress  function  appearing  in  the  algorithms  with  a  scalar 
function,  such  as  thermal  energy  density,  of  the  heat  flux  vector.  However,  since  heat  flux  is  typically 
not  computed  at  element  gauss  points  by  commercial  finite  element  codes,  the  error  analysis  code 
will  be  modified  to  compute  the  heat  flux  distribution  from  finite  element  nodal  temperature  results. 
Thermal  finite  elements  also  must  be  added  to  the  code  so  that  the  code  will  have  a  stand-alone 
thermal  finite  element  analysis  capability.  This  implementation  is  quite  straightforward. 

Error  estimation  based  on  comparing  an  “improved”  piecewise  continuous  function  (i.e.  von  Mises 
stress)  with  the  finite  element  discontinuous  function  are  based  on  a  continuity  assumption.  It  is 
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theoretical  solution  may,  in  fact,  be  discontinuous.  One  example  is  the  von  Mises  stress  function 
across  intermaterial  boundaries.  Under  these  conditions,  the  in-plane  Cauchy  stress  components 
parallel  to  the  intermaterial  boundary  may  be  discontinuous,  which  results  in  a  theoretically  dis¬ 
continuous  <rv  stress  function.  Accordin^y,  one  of  the  major  focus  of  future  work  will  be  to  modify 
the  error  estimation  algorithm  to  handle  intermateiiai  boundaries  and  other  sources  of  stress  dis¬ 
continuities  (such  ais  point  loads  perpendicular  to  interelement  boundaries).  Conceptually,  the 
proposed  error  estimation  algorithms  can  be  easily  modified  to  account  for  these  circumstances. 
However,  implementation  presents  a  more  serious  challenge.  Since  intermaterial  boundaries  are 
quite  common  in  microelectronic  devices,  this  work  is  also  of  critical  importance. 

Thirdly,  the  work  must  be  extended  to  nonlinear  problems.  Again,  this  appears  to  be  a  straightfor¬ 
ward  task,  because  the  error  estimation  algorithm  can  be  applied  independent  of  time  and  loading 
history.  However,  for  path  dependent  nonlinear  problems,  the  effect  of  small  inaccuracies  in  the 
solution  at  previous  time  steps  may  have  a  significant  impact  for  the  current  time  step.  This  raises 
the  question  of  conditional  stability  and  its  relationship  to  accuracy  which  must  be  explored. 

Finally,  future  work  will  include  extension  of  the  error  estimation  and  adaptive  remeshing  scheme 
to  3-D  problems.  A  3-D  version  of  the  FASTQ  automatic  mesh  generation  code  is  scheduled  to  be 
released  in  September  1991.  Only  minor  modifications  are  needed  to  extend  the  error  estimation 
algorithm  to  3-D.  However,  it  is  expected  that  a  significant  amount  of  numerical  studies  will  be 
needed  to  study,  detect,  and  correct  automatic  adaptive  mesh  generation  problems  in  3-D. 
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Figure  Captions 


Fig.  1:  Reliability  Assessment  by  the  Finite  Element  Method 
Fig.  2:  Proposed  Research  Project 

Fig.  3:  Error  Analysis  Process  Using  FASTQ  and  ERRFEM  Codes 

Fig.  4(a):  Plate  with  a  Hole 

Fig.  4(b):  Cantilever  Beam  Under  Bending 

Fig.  5(a):  Initial  Coarse  Mesh  (16  Elements) 

Fig.  5(b):  Normalized  von  Mises  Stress  Distribution 
Fig.  5(c):  Adaptive  Accuracy  Ratio  Distribution,  LSF  Method 
Fig.  5(d):  Error  Ratio  Distribution,  LSF  Method 
Fig.  6(a):  First  Remesh,  LSF  Method  (29  Elements) 

Fig.  6(b):  Normalized  von  Mises  Stress  Distribution 

Fig.  6(c):  Adaptive  Accuracy  Ratio  Distribution,  LSF  Method 

Fig.  6(d):  Error  Ratio  Distribution,  LSF  Method 

Fig.  7(a):  Third  (Final)  Remesh,  LSF  Method  (72  Elements) 


2-36 


Fig.  7(b):  Normalized  von  Mises  Stress  Distribution 

Fig.  7(c):  Adaptive  Accuracy  Ratio  Distribution,  LSF  Method 

Fig.  7(d):  Error  Ratio  Distribution,  LSF  Method 

Fig.  8(a):  Adaptive  Accuracy  Ratio  Distribution,  SE  Method 

Fig.  8(b):  Error  Ratio  Distribution,  SE  Method 

Fig.  9(a):  First  Remesh,  SE  Method  (45  Elements) 

Fig.  9(b):  Normalized  von  Mises  Stress  Distribution 

Fig.  9(c):  Adaptive  Accuracy  Ratio  Distribution,  SE  Method 

Fig.  9(d):  Error  Ratio  Distribution,  SE  Method 

Fig.  10(a):  Second  Remesh,  SE  Method  (81  Elements) 

Fig.  10(b):  Normalized  von  Mises  Stress  Distribution 

Fig.  10(c):  Adaptive  Accuracy  Ratio  Distribution,  SE  Method 

Fig.  10(d):  Error  Ratio  Distribution,  SE  Method 

Fig.  11(a):  Convergence  Comparison  of  LSF  and  SE  Methods 

Fig.  11(b):  Convergence  Comparison  of  Nonadaptive  vs.  Adaptive  Accuracy 
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Fig.  12(a):  Initial  Coarse  Mesh,  SE  Method  (40  Elements) 

Fig.  12(b):  Normalized  von  Mises  Stress  Distribution 
Fig.  12(c):  Adaptive  Accuracy  Ratio  Distribution 
Fig.  12(d):  Error  Ratio  Distribution 
Fig.  13(a):  First  Remesh,  SE  Method  (74  Elements) 

Fig.  13(b):  Normalized  von  Mises  Stress  Distribution 
Fig.  13(c):  Adaptive  Accuracy  Ratio  Distribution 
Fig.  13(d):  Error  Ratio  Distribution 

Fig.  14(a):  Final  (Third)  Remesh,  SE  Method  (168  Elements) 

Fig.  14(b):  Normalized  von  Mises  Stress  Distribution 

Fig.  14(c):  Adaptive  Accuracy  Ratio  Distribution 

Fig.  14(d):  Error  Ratio  Distribution 

Fig.  15:  Convergence  of  the  Error  for  the  Beam  Example 
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Figure  1:  Reliability  Assessment  by  the  Finite  Element  Method 
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Figure  2:  Proposed  Research  Project 
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ig.  3:  ERROR  ANALYSIS  PROCESS  USING  FASTQ  AND  ERRFEM  CODES 
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Fig.  4(a)  Plate  with  a  Hole. 
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Fig.  4(b)  Cantilever  Beam  Under  Bending. 
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Fig.  6(a)  First  Remesh,  LSF  Method  (29  Elements) 
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Fig.  6(c)  Adaptive  Accuracy  Ratio  Distribution 
LSF  Method. 
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Fig.  7(a)  Third  (Final)  Remesh,  LSF  Method  (72  Elements) 
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Fig.  7(c)  Adaptive  Accuracy  Ratio  Distribution 
LSF  Method. 
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LSF  Method. 
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Fig.  8(a)  Adaptive  Accuracy  Ratio  Distribution. 
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Fig.  8(b)  Error  Ratio  Distribution. 
SE  Method. 
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First  Remesh,  SE  Method  (45  Elements) 
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Fig.  10(a)  Second  Remesh,  SE  Method  (81  Elements) 


Fig.  10(b)  Normalized  von  Mises  Stress  Distribution 
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Fig.  11(b):  Convergence  Comparison  of  Nonadaptive  vs. 
Adaptive  Accuracy 


2-55 


mvA\ 


Fig. 13(b)  Normalized  von  Mises  Stress  Distribution 


Min;  0.03 
Max:  0.20 


Fig. 13(c)  Adaptive  Accuracy  Ratio  Distribution 


Min:  0.08 
Max:  7.23 


Fig. 13(d)  Error  Ratio  Distribution 


Fig. 14(a)  Final  (Third)  Remesh,  SE  Method  (168  Elements) 
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Fig. 14(b)  Normalized  von  Mises  Stress  Distribution 
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Fig. 14(c)  Adaptive  Accuracy  Ratio  Distribution 
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Fig. 14(d)  Error  Ratio  Distribution 
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Fig.  15:  Convergence  of  the  Error  for  the  Beam  Example 
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Abstract:  This  report  presents  some  measurements  of  HF  received 
signal  phase  characteristics.  Phase  distortion  across  a  large 
aperture  array  is  examined  and  origins  and  initial  experimental 
measures  of  these  phase  errors  are  examined.  The  effects  of 
propagation  mode  separation  are  detailed  and  an  experimental 
approach  providing  a  statistical  description  of  phase  distortion 
is  formulated.  Experimental  measurements  were  made  using  both  E-W 
and  N-S  linear  array  baselines  for  a  one-hop,  2400  km.,  near 
geomagnetic  north-south  path  between  Ava,  NY  and  Boca  Raton,  FL. 
Initial  unmodulated  CW  characterization  is  described  and  test 
results  summarized.  A  second  set  of  experiments  involving  a  127 
length  PN  code,  was  used  to  study  mode  separation.  An  extensive 
bibliography  highlighting  past  and  current  research  in  ionospheric 
path  characterization  is  presented. 


1.0  Introduction 


For  an  ideal  phased  array  radar,  angular  resolution  is 
directly  proportional  to  receiver  array  aperture  [1]  and  the  phase 
coherence  across  the  aperture.  Unlike  Line-Of-Sight  (LOS) 
systems,  Over-The-Horizon  (OTH)  HF  radars  [2]  rely  on  refraction 
through  the  ionosphere  which  can  bring  an  attendant  degradation  of 
phase  coherence.  As  apertures  are  increased,  a  "coherence- 
limited"  phenomenon  is  observed  which  limits  angular  resolution 
and  causes  beamshape  spoilage  [3] .  This  phase  distortion  is 
quantified  by  root -mean-square  (rms)  phase  error  and  correlation 
length,  i.e.  length  where  spatial  cross  correlation  coefficient 
reduces  by  1/e  [4] .  Although  research  in  this  area  is  not 
extensive,  some  results  indicate  correlation  lengths  on  the  order 
of  1  kilometer,  corresponding  to  10-100  wavelengths  in  the  3-30 
MHz  HF  band  [5] .  Since  larger  apertures  are  desirable  for 
improved  angular  resolution,  additional  knowledge  of  phase 
coherence  across  wide-aperture  HF  phased  arrays  is  needed.  With 
this  information,  system  performance  can  be  analyzed  and  optimum 
operational  parameters  (eg.  frequency,  modulation)  can  be 
selected. 

The  determination  of  phase  error  and  correlation  length  is 
the  subject  of  this  research.  The  experimental  characterization 
required  special  signal  processing  techniques  to  assure  accuracy 
of  wide-aperture  phase  distortion  characteristics.  A  statistical 
description  is  selected  over  a  theoretical  or  empirical  approach 
because  results  can  be  directly  entered  into  the  radar  performance 
evaluations  [6] .  The  phase  coherence  statistics  involve 
correlation  measures  of  spatial,  tenqporal,  and  for  wide  bandwidth, 
frequency  coherence.  Beamshape  spoilage  and  pointing  error 
statistics  directly  follow  from  this  phase  coherence  statistical 
description  [7] 

A  complete  description  would  require  analysis  of  relevant 
ionospheric  parameters  including  hourly  solar  flux/geomagnetic 
activity  vs.  time-of-day,  day-of-month,  month-of-year,  and  year- 
of-1 1-year  solar  cycle.  These  effects  combine  and  can  be  viewed  as 
a  continuously  changing,  distorted  wavefront  impinging  on  the 
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wide-aperture  array  [8] . 


The  Icey  obstacle  to  accurate  experimental  characterization 
lies  in  ionospheric  propagation  mode  separation  [9] .  Although  a 
single,  linearly  polarized  wavefront  may  be  launched  from  the 
transmit  site,  the  signal  arriving  at  the  receive  site  will 
contain  components  of  orthogonal  polarization  [10],  differing  path 
lengths  [10],  Doppler  shifts  [11],  and  amplitude  fading  [12]. 

Since  the  focus  of  this  research  is  phase  coherence,  path 
length  separation  is  addressed.  This  necessitates  employing 
special  mode  separation  signal  processing  techniques  and 
constitutes  the  challenge  in  this  investigation.  The  other 
identified  phenomena  are  not  encompassed.  Path  length  separation 
signal  processing  techniques  are  similar  to  ranging  techniques 
utilized  in  radar  and  navigation  application  [13]  .  Innovative 
signal  processing  techniques  were  required  to  enhance  mode- 
separation  accuracy. 

%  * 

2 . 0  Background 

Many  years  of  extensive  research  have  provided  much  knowledge 
of  both  ionospheric  propagation  and  the  underlying  physical 
processes  responsible  for  these  characteristics  [14],  [15]. 
Ionospheric  propagation  amplitude  characteristics  have  received 
the  bulk  of  the  emphasis  since  point-to-point  AM  communications 
have  been  the  primary  application.  Phase  characteristics  have 
received  less  attention  due  to  their  limited  applications,  viz. 

OTH  radar.  These  phase  characteristics  are  quantified  by  phase 
error  and  spatial  correlation  length.  Phase  error  is  the  phase 
difference  between  a  signal  that  propagates  through  the  ionosphere 
and  a  signal  that  propagates  through  an  equi-distant  ideal 
transmission  medium.  In  general  for  an  N  element  array,  these 
phase  errors  relative  to  some  ideal  reference  are  of  the  form 
shown  in  Figure  1.1.  In  the  absence  of  this  ideal  reference,  the 
signal  from  element  #1  can  be  used  as  reference.  In  this  case. 
Figure  1.2  depicts  this  more-readily  measurable  phase  differences 
between  progressively  separated  array  elements.  Correlation  length 
is  determined  from  the  cross  correlation  function  of  these  phase 
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error  results.  It  is  noted  that  phase  difference  in  Figure  1.2 
varies  from  zero  to  twice  the  phase  error  of  Figure  1.1  as  the 
element  separation  increases  from  zero  to  lengths  much  greater 
than  the  correlation  length,  i.e.  when  the  two  elements  become 
uncorrelated. 

1.1  Origin  of  Phase  Error 

The  ionosphere  consists  of  stratified  layers  (75-400km.)  of 
ions  (plasma)  which  are  generated  by  solar  radiation  and 
subsequently  influenced  by  the  geomagnetic  field.  Refractive 
effects  of  these  ionic  layers  cause  HF  electromagnetic  ray  path 
bending.  The  predominate  layer  is  the  high-altitude  (350km.) 
F-layer  (night)  which  separates  during  daytime  into  two  layers,  FI 
and  F2  [16].  An  intermediate-altitude  (100km.)  E-layer  also 
provides  additional  propagation  paths.  During  daytime,  a  low 
altitude  (75km.)  D-layer  is  produced  which  attenuates  lower 
frequency  waves  (^10  MHz) .  This  layer  rapidly  dissipates  after 
sunset. 


Solar  flux  (radiation)  affects  the  number  of  ions  created, 
while  terrestrial  geomagnetic  activity  affects  the  disturbances 
and  motions  of  these  ions.  Solar  radiation  is  related  to  sunspot 
number.  It  is  characterized  by  daily  radio  solar  flux  measurements 
at  Ottawa,  Canada  (2695  MHz) .  Geomagnetic  activity  is  quantified 
by  K  and  A  indices  and  is  monitored  at  multiple  observatories 
around  the  world.  Broadcasts  of  solar  flux  and  geomagnetic 
activity  are  available  on  5,  10,  15,  and  20  MHz  from  WWV  in 
Boulder,  Colorado.  This  information  is  also  published 
periodically.  Any  Sudden  Ionospheric  Disturbances  (SID's)  are 
irdicated  by  an  associated  sun-earth  8  minute  radiation  transit 
time  and  40  hour  particle  transit  time.  Message  information  is 
also  available  from  the  Space  Environment  Laboratory. 

Since  ionospheric  propagation  is  so  dependent  on  solar 
activity,  there  are  cyclical  effects  related  to  time-of-day 
(diurnal  effects),  day-of-month  (27  day  solar  rotation  period), 
nonth-of-year  (  seasonal  effects),  and  year-of-ll-year  solar  cycle 
(solar  phenomenon) .  Since  the  latest  Cycle  22  peaked  in  1990,  this 
research  encompasses  a  snapshot  immediately  following  this  peak. 
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1.2  Ionospheric  Status 


Ionospheric  propagation  frequency  characteristics  are 
measured  at  numerous  sites  around  the  world  [17] .  Bottom-side 
sounding  (ionograms,  sweep  the  ionosphere  through  the  HF  band  and 
measure  characteristics  such  as  critical  frequency,  Maximum- 
Usable-Frequency  (MUF)  and  Lowest-Usable-Frequency  (LUF) ,  from 
which  the  Optimum-WorJcing-Frequency  (OWF)  or  Optimum  Traffic 
Frequency  (OTF)  can  be  derived  [18]  .  Top-side  sounding  had  been 
conducted  from  satellites  [19]  but  no  active  program  currently 
exists.  These  experimental  results  have  aided  in  the  development 
of  ionospheric  frequency  propagation  models  [20]  . 

Ionospheric  physical  composition  has  also  been  researched 
extensively  yielding  three  distinctive  regions:  polar,  mid¬ 
latitude,  and  equatorial  [21] .  The  polar  regions  exhibit 
distinctive  characteristics  due  to  the  accumulation  of  solar 
particles  around  the  poles,  while  the  equatorial  region  with  its 
abundant  solar  radiation  exhibits  distinctive  characteristics,  eg. 
enhanced  propagation  along  trans-equatorial  (TE)  north-south 
paths.  Also,  gray-line  propagation  along  the  Earth's  terminator 
exhibits  distinctive  characteristics  due  to  the  transient  behavior 
of  ionic  concentrations  in  the  upper  and  lower  layers.  The  gray¬ 
line  is  the  band  of  twilight  that  separates  the  white  (day)  and 
black  (night)  regions  of  the  Earth  [22). 

1.3  Multipath 

Direction  of  propagation  relative  to  the  geomagnetic  field 
(Bo)  gives  rise  to  differing  propagation  characteristics.  Faraday 
rotation  is  the  rotation  of  polarization  due  to  the  Lorentz  force 
acting  on  electrons  as  a  wave  propagates  in  the  presence  of  Bq, 
i.e.  q(v  x  Bq)  [23].  For  propagation  parallel  to  Bq,  a  linei.  ..-y 
polarized  wave  can  be  decc  posed  into  a  right-hand  circular  (RHC) 
and  left-hand  circular  (LHC)  components.  Each  experiences 
differing  phase  velocity  and  attenuation  parameters  resulting  in  a 
net  linear  polarization  rotation. 


For  propagation  perpendicular  to  Bq,  a  linearly  polarized 


wave  consists  of  a  component  parallel  to  Bq  (ordinary  component 
and  unaffected  by  Bq)  and  a  component  perpendicular  to  Bq 
(extraordinary  component  and  rotated  by  Bq) .  A  given  ray  path  will 
experience  a  complicated  combination  of  these  effects;  the 
resultant  effect  is  a  signal  separation  at  the  receive  site.  The 
IEEE  Standard  Definition  of  Terms  for  Radio-Wave  Propagation 
states  the  ordinary  wave  component  'deviates  the  least  relative  to 
a  wave  in  the  absence  of  Bq",  while  the  extraordinary  wave 
component  has  its  "  electric  vector  in  the  opposite  sense  to  that 
of  the  ordinary  wave  component"  [24] . 

1 . 4  Current  Research 

This  research  investigates  propagation  along  a  North-South 
geomagnetic  field-aligned  path.  Related  phase  characteristic  work 
has  been  done  in  the  area  of  radio  astronomy  along  very  long 
baselines  [25] .  Scintillation  effects  are  similar  to  what  is 
encountered  in  this  investigation,  but  important  differences  do 
exist.  This  research  addresses  oblique  (near-tangential)  as 
opposed  to  normal  (near-perpendicular)  incidence  and  a 
deterministic  as  opposed  to  noise-like  signal  characteristics. 

Steinberg  [26]  investigates  methods  of  increasing  HF  radar 
azimuthal  resolution.  Crane  [27]  reports  theory  and  experimental 
results  of  ionospheric  scintillation  effects  on  angle-of-arrival 
and  Doppler  fluctuations.  Humphrey  [28]  examines  multi-path  phase 
fluctuation  over  a  Panama-New  York  one-hop  path.  Kieburtz  [29] 
analyzes  angle-of-arrival  measurements  by  phase  differences  in  a 
two-element  interferometer.  Barrick  [30]  reports  techniques  and 
results  of  wide-aperture  phase  distortion  measurements  between 
California-Arkansas . 

Signal  processing  research  related  to  ionospheric 
characterization  is  quite  sparse.  Some  work  in  support  of 
frequency  characterization  (sounding)  has  been  reported  by  Coll, 
et.al.  [31].  Although  FM/CW  is  a  popular  ranging  technique,  other 
modulations  can  provide  improvements  in  characterization  accuracy. 
Application  of  spread-spectrum  Pseudo-Noise  (PN)  and  complementary 
code  (CC)  signal  processing  techniques  offers  great  promise  for 


3-9 


expanding  the  body  of  knowledge  of  ionospheric  propagation 
[32-33] . 

2.0  CW  Characterization 

The  basic  aim  of  the  CW  characterization  is  to  receive  an 
ionospherically  refracted  signal  at  multiple  elements  of  two 
perpendicular  linear  arrays  and  deduce  izimuth  and  elevation 
angle-of-arrival  from  these  measurements.  The  mean  of  the  angle- 
of-arrival  represents  the  direction  of  the  incoming  wavefront  and 
variations  of  angle-of-arrival  represents  ionospherically-induced 
distortion  or  wander.  Since  long  baselines  and  multi-wavelength 
element  spacings  are  desired,  angle-of-arrival  ambiguities  due  to 
grating  lobes  must  be  resolved  by  using  techniques  such  as 
multiple-frequencies,  multiple-spacings,  etc.  In  this  context,  an 
ambiguity  number  n  denotes  the  integral  number  of  wavelengths  in 
some  electrical  length  geometric  parameter. 

Due  to  its  simplicity,  unmodulated  CW  tests  were  selected  for 
the  initial  characterization.  Since  CW  does  not  incorporate  any 
mode  separation  signal  processing,  the  higher  end  of  the  HF  band 
(>  10  MHz)  was  selected  since  ordinary-extraordinary  wave  mode 
separation  is  greatly  reduced  in  this  frequency  range. 

A  one-hop  F-layer  near  field-aligned  path  exists  nearly 
always  in  the  daytime  between  Ava,  NY  and  FAU  at  Boca  Raton  FL. 
This  2400  km.  path  has  its  midpoint  over  North  Carolina  and  the 
ionospheric  interaction  is  confined  to  a  length  of  approximately 
100  km.  Within  this  F-layer  one-hop  path,  typical  path  length 
differences  are  expected  to  be  on  the  order  of  tens  of 
microseconds.  Multipath  involving  other  layers,  path  length 
differences  are  expected  to  be  on  the  order  of  hundreds  of 
microseconds.  The  deviation  from  exact  geomagnetic  field  alignment 
is  10°,  i.e.  the  difference  between  the  FAU  2.5°  and  Ava,  NY 
12.5°  magnetic  declinations.  A  site's  magnetic  declination  is  the 
deviation  between  magnetic  North  and  true  North.  Relative  to  FAU, 
map  data  indicate  angle-of-arrival  to  Ava,  NY  is  12.5°  azimuth 
(NE)  and  6°-12°  elevation  (corresponding  to  virtual  heights  of 
200-400  km. )  . 
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2 . 1  Experimental  Set-up 


E-W  and  N-S  baselines  were  laid  out  as  shown  in  Figure  2.1. 
Element  locations  are  labeled  as  indicated.  Existent  surveyed 
boundaries  such  as  roads  and  runways  were  used  for  coarse 
positioning  (±10  cm.) .  Although  the  N-S  baseline  could  not  be 
visually  sighted  for  straightness,  the  E-W  baseline  was  in  open 
field  and  binocular  sighting  of  straightness  yielded  a  ±2  cm.  fine 
position  adjustment.  Figure  2.2  depicts  the  E-W  and  N-S  baseline 
system  configurations . 

The  antenna  elements  selected  were  3  m. verticals  which  are 
approximately  quarter  wavelength  at  20  MHz.  These  were  constructed 
with  1.9  cm.  (3/4  in.)  aluminum  conduit  threaded  on  both  ends. 

The  lower  end  is  screwed  into  a  dielectric  (PVC)  coupling  and 
a  0.75  m.  ground  rod.  When  10  MHz  operation  is  desired,  a  second 
3  m.  section  is  added  providing  an  approximate  quarter  wavelength 
at  10  MHz.  This  ground  system  provided  satisfactory  results  due 
to  the  high  ground  conductivity  in  this  geographic  area  and  the 
receive-only  test  program.  Receive-only  minimizes  the  importance 
of  antenna  efficiency  since,  in  the  HF  band,  atmospheric  noise 
dominates  receiver  thermal  noise  and  receive  signal-to-noise  ratio 
is  not  degraded  by  low  antenna  efficiency  or  receiver  noise 
figure.  Also,  the  simplicity  of  this  ground  system  enabled  quiclc 
assembly  and  disassembly. 

The  transmission  lines  were  Government-furnished  RG-214  in 
145  m.  spools.  HF  networlc  analyzer  electrical  phase  length 
measurements  were  performed  and  lengths  trimmed  to  within  ±0.5°  at 
20  MHz.  Although  exact  electrical  length  equalization  is  desired, 
small  deviations  due  to  cable  spooling,  unspooling,  flexing,  etc. 
are  calibrated  out  by  a  "central  antenna"  calibration  procedure 
described  below. 

The  receiver/controller  equipment  consisted  of  two 
Government-furnished  Racal  R-2174  HF  receivers,  Hewlett -PacJcard 
35A  Phase  Meter,  TeJctronix  7D20  Programmable  Digitizer,  Te)ctronix 
2214  Digital  Oscilloscope,  and  a  Macintosh  computer  for  data 
storage.  In  addition,  an  RF  transfer  switching  function  is 
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Figure  2. 1  East-West  and  North-South  Baselines 
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Figure  2.2a  East-West  Baseline  System  Configuration 
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TL;RG-214  ( 1 45  m.  Section) 

SG:  Signal  Generator  (Calibration) 
Rx;  Two  -  Racal  R-2174 
Phase  Meter:  HP  3575A 
Digtal  Scope:  Tektronix  7D20 
Computer:  Mac  Plus  (45M  Storage) 


Figure  2.2b  North-South  Baseline  System  Configuration 
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necessary  for  the  calibration  procedure. 


2.2  Calibration  Procedure 

A  "central  antenna"  calibration  technique  was  used  to 
calibrate  out  any  2-element  phase  difference  error  components,  eg. 
ground  conductivity  variations,  transmission  line  electrical 
length  deviations,  etc.  The  local  calibration  signal  power 
(milliwatts)  is  set  to  simulate  the  received  signal  power  of 
ionospheric  measurements  as  indicated  on  the  receiver  RF  signal 
strength  bargraph  indicator.  The  basic  concept  is  to  transmit  the 
calibration  signal  from  a  geometrically-centered  element  and 
measure  the  phase  difference  between  two  symmetrically  spaced 
receive  elements,  eg.  Phase (A)  -  Phase(B).  If  the  measurement  is 
repeated  with  the  two  antenna/transmission  line  paths  interchanged 
(via  RF  DPDT  transfer  switch),  a  second  phase  difference  can  be 
measured, eg.  Phase  (B)  -  Phase (A).  Hence,  the  mean  of  these  two 
measurements  represents  the  zero  phase  difference  reference 
associated  with  cable  length  or  ground  conductivity  differences  in 
the  actual  ionospheric  measurements.  Figure  2.3  is  a  typical 
calibration  plot.  Since  a  commercially-available  transfer  switch 
had  not  been  procured  during  these  measurements,  simple  connection 
interchange  was  used  which  tooJc  approximately  20  seconds  to 
perform.  Hence,  Figure  2.3  indicates  a  20  second  dwell  in  one 
configuration,  a  20  second  interchange  period  (meaningless  data) , 
another  20  second  dwell  in  the  interchanged  configuration,  another 
20  interchange  period  (meaningless  data),  and,  finally,  another  20 
second  dwell  in  the  original  configuration.  Hence,  the  phase 
difference  zero  reference  is  the  mean  of  the  two  outer  regions  and 
the  inner  region.  Since  this  is  not  automated  into  the 
measurement,  this  phase  difference  zero  reference  is  determined 
manually  from  the  data. 

2.3  CW  E-w  Baseline  Ionospheric  Measurements 

Initial  Ava,  NY-FAU  measurements  used  unmodulated  CW  at 
10.205  and  20.861  MHz  with  a  400  Hz  receiver  bandwidth.  No  mode 
separation  techniques  were  included  with  these  measurements. 

Since  these  were  daytime  measurements  and  path  attenuation  reduces 
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E-W  FIELD  SITE  -  10FT  VERTICALS 

20.861  MHz,  CW,  E2/W2  SEPARATION  =  580  m  CALIBRATION 
Date:  08-16-1990  Time:  22:21:56  Zero  Phase:  0  Slope: 
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Figure  2.3 


as  frequency  increasesr  the  20.861  MHz  signal  was  much  stronger 
than  the  10.205  MHz  and  provided  more  meaningful  results. 

Although  the  10.205  MHz  signal  was  detectable,  the  excessive 
amplitude  fading  did  not  allow  any  meaningful  phase  difference 
measurements  which  was  an  indication  of  multipath/multihop. 

2.3.1  El/Wl  (Separation  »  290  m.) 

Figure  2.4  is  an  ionospheric  measurement  and  Figure  2.5  is 
the  associated  calibration  data.  In  Figure  2.5,  the  0-15  second 
and  85-100  second  periods  represent  Phase (Wl)  -  Phase (El) 
calibration  signal  and  the  41-68  second  period  represents 
Phase (El)  -  Phase  (Wl)  calibration  signal.  Again,  the  phase 

difference  zero  reference  is  the  mean  of  this  data  (57.5°).  Some 
background  noise  is  superimposed  on  these  levels  and  would  also  be 
present  on  ionospheric  data.  It  is  noted  the  vertical  scale 
differences  between  ionospheric  data  and  calibration  data.  Figure 
2.6  represents  a  histogram  of  the  Figure  2.4  data.  Figure  2.7 
illustrates  the  geometry  of  the  configuration,  the  angJ.e-of- 
arrival  ambiguities,  and  ambiguity  resolution  based  on  standard 
map  measurements.  Again,  the  ambiguity  number  n  is  the  integral 
number  of  electrical  wavelengths  included  in  the  geometric  length 
LI  . 

2.3.2  E2/W2  (Separation  =  580  m.) 

Figures  2.8-2.10  represent  similar  plots  as  those  described 
above  except  for  E2/W2  measurements.  Figure  2.11  depicts  the 
geometry,  angle-of-arrival  ambiguities,  and  comparison  with  known 
direction-of-arrival . 

2.3.3  W1/E2  (Separation  =  435  m.) 

Due  to  lack  of  symmetry  of  two  elements  relative  to  the 
central  calibration  element,  the  "central  antenna"  calibration 
technique  cannot  be  used  for  obtaining  a  phase  difference  zero 
reference  and,  hence,  mean  angle-of-arrival.  Nevertheless,  the 
time-varying  component  of  phase  difference  is  valid  and  is  shown 
in  Figures  2.12  and  2.13. 

2.3.4  W2/E1  (Separation  -  435  m.) 
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20.861  MHz,  CW,  E1/W1  SEPARATION  =  290  m  DATA  RUN 

Date:  08-16-1990  Time:  21:15:37  Zero  Phase:  57.5  Slope:  2.1157 
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Time  (Sec) 

Resolution  (Sec):  .1  Data  File:  aug16.11 

Figure  2.4 


E-W  FIELD  SITE  -  10FT  VERTICALS 

20.861  MHz,  CW,  E1/W1  SEPARATION  =  290  m  DATA  RUN 

Date:  08-16-1990  Time:  21:15:37  Zero  Phase:  57.5  Slope:  2.1157 
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Delta  Phase  (Deg) 

Resolution  (Sec):  .1  Bin  Width  (Deg):  5.0  Data  File:  aug16.11 

Figure  2.6 


X=c/(20.861  MHZ)  =  14.38m. 

Mean[Phase(  W 1 )  -  Phase(E  1  )l=+238.5“=-l  2 1 .5®  (measured) 
Phase(W  1 )  -  Phase(E  1 )  =  -|3(L  I ) 

-  n(2TT)-  121.5®(tt/180®)=  -(2tt/ 1  4.38m. )L  1 
L I  =  4.85m.  n(  1 4.38m.) 

Az  =  arcsindl  /290m.) 
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Figure  2.7a  Angle-of-Arnval  Azimuth  Calculations 
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Notes; 

1.  Map  measurements  indicate 
Ava.NY  -  FAU  «  12.5«. 

2.  Since  360® phase  shift  corresponds  to  3‘ 
(or  52.36  mrads)  of  angle-of-arrival,  i® 
phase  shift  corresponds  to  .  1 5  mrads. 


Figure  2.7b  Angle-of-Arrival  Azimuth  Ambiguities 
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E-W  FIELD  SITE  -  10FT  VERTICALS 

20.061  MHz,  CW,  E2/W2  SEPARATION  =  580  m  DATA  RUN 

Date:  08-16-1990  Time:  20:16:19  Zero  Phase:  195  Slope:  2.1157 
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Resolution  (Sec):  .1  Data  File:  aug16.04 

Figure  2.8 


E-W  FIELD  SITE  -  10FT  VERTICALS 

20.061  MHz,  CW,  E2/W2  SEPARATION  =  580  m  CALIBRATION 
Date:  08-16-1990  Time:  20:08:27  Zero  Phase:  0  Slope: 
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Time  (Sec) 

Resolution  (Sec):  .1  Data  File:  aug16.03 

Figure  2.9 


E-W  FIELD  SITE  -  10FT  VERTICALS 
20.061  MHz,  CW,  E2/W2  SEPARATION  =  580  m  DATA  RUN 
Date:  08-16-1990  Time:  20:16:19  Zero  Phase:  195  Slope:  2.1157 
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Resolution  (Sec):. 1  Bin  Width  (Deg):  5.0  Data  File:  aug16.04 

Figure  2.10 
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X=C/(20.86I  MHz)  =  14.38m. 

Mean[  Phase(W2)  -  Phase(E2)  ]  =  -142®  (measured) 
Phase(W2)  -  Phase(E2)  =  -p(L2) 

-  n(2TT)-  142®(TT/180®)=-(2TT/l4.38m.)L2 
L2  =  5.6  1  m.  +  n(  1  4.38m.) 

Az  =  arcsin(L2/580m.) 
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Figure  2. 1  1  a  Angie-of-  Arrival  Azimuth  Calculations 
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1.  Map  measurements  indicate 
Ava.NY  -  FAU  «  12.5°. 

2.  Since  360°phase  shift  corresponds  to  i  5‘ 
(or  26.18  mrads)  of  angle-of-arrival,  1° 
phase  shift  corresponds  to  .07  mrads. 


Figure  2.1  lb  Angle-of-Arrival  Azimuth  Ambiguities 
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E-W  FIELD  SITE  -  10FT  VERTICALS 

20.861  MHz,  CW,  E2/W1  SEPARATION  =  435  m  DATA  RUN 

Date:  08-16-1990  Time:  22:46:39  Zero  Phase:  185  Slope:  2.1157 
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Time  (Sec) 

Resolution  (Sec):  .1  Data  File:  aug16.21 

Figure  2.12 
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20.861  MHz,  CW,  E2/W1  SEPARATION  =  435  m  DATA  RUN 
Date:  08-16-1990  Time:  22:46:39  Zero  Phase:  185  Slope: 
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Delta  Phase  (Deg) 

Resolution  (Sec):  .1  Bin  Width  (Deg):  5.0  Data  File:  aug16.21 


1 


Again,  the  "central  antenna"  calibration  technique  cannot  be  t 

used  and  only  the  time-varying  component  of  phase  difference  is 
valid  and  shown  in  Figures  2.14  and  2.15.  ^ 

2.3.5  Summary 

(Representative  Data  With  Amplitude  Within  Receiver  Dynamic  Range) 


StdOev 

StdDev 

Mean 

Configuration 

[Phase  Difference! 

[Az imuth] 

[Azimuth] 

(degrees) 

(mrads) 

(degrees) 

Wl/El 

(290 

m. ) 

13.0 

1.95 

12.4 

W1/E2 

(435 

m. ) 

16.6 

1.87 

- 

W2/E1 

(435 

m. ) 

41.4 

4.66 

- 

W2/E2 

(580 

m. ) 

14.7 

1.10 

12.0 

W2/E2 

(580 

m. ) 

22.1 

1.65 

12.1 

W2/E2 

(580 

m. ) 

50.6 

3.79 

12.6 

Although  the  angle-of-arrival  data  is  reasonable  when 
compared  to  the  known  direction-of-arrival,  standard  deviation  of 
phase  difference  does  not  indicate  a  trend  as  aperture  width 
increases.  This  could  be  due  to  an  insufficient  number  of  samples 
or  the  lack  of  any  mode  separation  techniques  in  the  unmodulated 
CW  transmissions.  Hence,  further  measurements  must  incorporate 
these  mode  separation  techniques . 

2.4  N-S  Baseline  Ionospheric  Measurements 

Again,  since  the  20.861  MHz  signals  were  much  stronger  than 
the  10.205  MHz  signals,  only  20.861  MHz  results  are  reported. 

2.4.1  Sl/Nl  (Separation  -  290  m.) 

Figures  2.16-2.18  are  these  measurement  results  and  Figure 
2.19  depicts  the  associated  geometry  and  calculations. 
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E-W  FIELD  SITE  -  10FT  VERTICALS 

20.861  MHz,  CW,  E1/W2  SEPARATION  =  435  m  DATA  RUN 

Date:  08-16-1990  Time:  21:55:42  Zero  Phase:  109  Slope:  2.1157 
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Resolution  (Sec):  .1  Data  File:  aug16.16 

Figure  2.14 


N-S  FIELD  SITE-10  FT  VERTICALS 

20.061  MHz,  CW,  N1/S1  SEPARATION  =  290  m  DATA  RUN 

Date:  08-24-1990  Time:  20:45:57  Zero  Phase:  104  Slope:  2.1157 
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Resolution  (Sec):  .1  Data  File:  aug24.04 

Figure  2.16 


N-S  FIELD  SITE-10  FT  VERTICALS 

20.061  MHz,  CW,  N1/S1  SEPARATION  =  290  m  DATA  RUN 

Date:  08-24-1990  Time:  20:45:57  Zero  Phase:  104  Slope:  2.1157 
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Resolution  (Sec):  .1  Bin  Width  (Deg):  5.0  Data  File:  aug24.04 

Figure  2.18 


Zenith 


X=c/(20.86l  MHz)  =  U.38m. 

Meant  Phase(S  I )  -  Phase(N  1 )  ]  =  -227®  (measured) 
Phase(S  1 )  -  Phase(N  1 )  =  -p(L  1 ) 

-  n(2TT) -  227® (TT/  1 80®)  =  -(2tt/  1  4.38m. )L  I 
L I  =  9.06m.  +  n(  1 4.38m.) 

EL  =  arccosd  I  /290m.) 

Note:  Since  LI  <;  290m.  ,  r\  i  19. 


Figure  2. 1 9a  Angle-of-Arrival  Elevation  Calculations 
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nqUs; 

1.  For  the  Ava,NY-FAU  ground  range  of  2400km.,  Griffiths 
[33]  indicates  elevation  angle-of-arrival  ranges  6°-12® 
for  effective  heights  of  200-400km.,  respectively. 

2.  Since  360®  phase  shift  corresponds  to  9.2®  (or  1 60.5  mrads) 
of  angle-of-arrival,  I® phase  shift  corresponds  to 

.45  mrads. 


Figure  2. 1 9b  Angle-of-Arnval  Elevation  Ambiguities 
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2.4.2  S2/N2  (Separation  =  580  tn.) 


Figures  2.20-2.22  are  these  measurement  results  and  Figure 
2.23  depicts  the  associated  geometry  and  calculations 

2.4.3  Summary 

(Representative  Data  With  Amplitude  Within  Receiver  Dynamic  Range) 


Std  Dev 

Std  Dev 

Mean 

Configuration 

[Phase  Difference] 

[Elevation] 

[Elevation] 

(degrees) 

(mrads) 

(degrees) 

Sl/Nl  (290  m.) 

8.7 

4.6 

13.3 

S2/N2  (580  m.) 

26.7 

8.0 

14.5 

The  standard  deviation  and  mean  of  elevation  follow  expected 
results.  When  aperture  was  doubled,  standard  deviation  of  phase 
difference  more  than  doubled  and  standard  deviation  of  elevation 
degraded  from  constancy.  The  mean  of  elevation  results'  agree 
closely  with  published  models  [34],  i.e.  60-12°  for  virtual 
heights  of  200-400  )cm. 

3.0  Mode  Separation  Experiments 

An  analysis  of  the  CW  data  indicate  the  presence  of  sudden 
phase  jumps.  In  order  to  study  the  origin  of  these  phase  jumps,  a 
novel  mode  separation  technique  [33],  was  employed. 

The  transmitter  signal  was  phase  modulated  with  a  127  bit 
length  Pseudo  Random  (PN)  code  sequence  (i.e.  length  7  shift 
register  with  1,7  feedback  connections).  The  chip  rate  of  the 
transmitted  code  was  10  KHz.  At  the  receive  site,  two  receivers 
were  phase  locked  to  a  common  10  MHz  reference  signal.  Normally, 
one  would  mix  the  received  signal  down  to  baseband  and  then 
correlate  the  baseband  signal  with  the  transmitted  code  sequence. 
However  a  technique  based  on  direct  IF  sampling  was  used.  The  IF 
is  directly  undersampled  and  the  conversion  to  baseband  is  done 
via  software.  This  method  eliminates  the  need  for  two  balanced  A/D 
converters  needed  in  the  normal  process  of  conversion  to  baseband. 
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ricL-LT  oiic-iu  ri  vcniiOMuo 
20.861  MHz,  CW,  N2/S2  SEPARATION  =  580  m  DATA  RUN 
Date:  08-24-1990  Time:  21:46:41  Zero  Phase:  120  Slope:  2.1157 
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Resolution  (Sec):  .1  Data  File:  aug24.09 


N-S  FIELD  SITE-10  FT  VERTICALS 
20.861  Mhz,  CW,  N2/S2  SEPARATION  =  580  m  CALIBRATION 
Date:  08-24-1990  Time:  21:51:12  Zero  Phase:  0  Slope: 


Time  (Sec) 

Resolution  (Sec):  .1  Data  File:  aug24.10 

Figure  2.21 


580  m. 


X  =  C/(20.86I  MHz)  =  1438m. 


Mean[  Phase(S2)  -  Phase(N2)  ]  =  -60.4®  (measured) 
Phase(S2)  -  Phase(N2)  =  -p(L2) 

-  n(2TT)-60.4®  (11/180®)=  -(211/ 1 4.38m. )L1 
L2  =  15.1m.  >  n(14.38m.) 

EL  =  arccos(L2/580m.) 


Note.  Since  L2  i  580m.  ,  n  i  39. 
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Figure  2.23a  Angle-of-Arrtval  Elevation  Calculations 
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Notes: 

1.  For  the  Ava,NY-FAU  ground  range  of  2400km.,  Griffiths 
[33]  indicates  elevation  angle-of-arrival  ranges  6*-l2® 
for  effective  heights  of  200-400km,,  respectively. 

2.  Since  720®  phase  shift  corresponds  to  1 9.4®  -6.8®  =  1  2.6® 
(or  2 1 9.9  mrads)  of  angle-of-arrival,  1  ® phase  shift 
corresponds  to  .31  mrads. 


Figure  2.23b  Angle-of-Arrival  Elevation  Ambiguities 
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The  technique  was  successfully  tested  in  the  laboratory  prior  to 
the  field  measurements. 

In  the  field  experiments,  each  receiver's  455  kHz  IF  was 
sampled  at  52  KHz  via  a  digital  sampling  oscilloscope.  The  memory 
depth  of  the  oscilloscope  was  16k  per  channel  thus  permitting  two 
channels  of  ~24  sequential  code  sequences  (~300  millisec)  to  be 
captured . 

Two  algorithms  to  convert  to  baseband  were  considered.  One 
was  developed  by  C.  Rader  [35]  and  the  other  by  Waters  and  Jarrett 
at  NRL  [3  6-37]  .  In  both  laboratory  tests  and  on  field  data,  the 
NRL  algorithm  gave  lower  sidelobes  and  hence  was  used  in  the 
analysis . 

Initial  tests  were  made  in  March,  1991  at  24.5  MHz  with  a 
single,  roof  mounted  antenna.  In  August,  1991  at  15.59  MHz,  a 
field  set  of  measurements  was  made  using  the  El-Wl,  290m  antenna 
separation  shown  in  Figure  2.1.  In  these  latter  testsw  for 
calibration  purposes,  a  signal  from  one  antenna  was  split  equally 
into  each  receiver.  The  correlations  peaks  were  approximately 
equal  indicating  equivalent  processing  gain  in  each  channel. 

In  the  March  measurements,  only  a  single  correlation  peak  was 
observed  whereas  measurements  made  in  August  at  15.59  MHz  yielded 
two  or  more  peaks .  These  latter  measurements  were  made  in  the 
daytime  and  the  reports  from  Boulder  indicated  low  solar  activity 
and  geomagnetic  activity  A  =  4  and  K  <  4  indices.  Examples  of  a 
single  correlation  peak  for  1  code  length  and  stacked  24  code 
lengths  are  shown  in  Figures  3.1  and  3.2.  Note  that  over  the  24 
pulse  sequence,  no  significant  pulse  amplitude  variations  are 
observed.  The  additional  lower  amplitude  peak  to  the  right  is  due 
to  a  processing  sidelobe  introduced  by  a  small  frequency  mismatch 
between  the  receiver  IF  frequency  and  the  sampling  rate.  This 
mismatch  arose  from  the  fact  that  at  the  receive  site,  separate 
and  independent  crystal  controlled  frequency  synthesizers  were 
used  to  generate  the  receiver  455  KHz  and  the  oscilloscope  52  KHz 
sampling  rates. 
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24.5  Mhz,  CW  Mode,  Root  Antenna.  455  kilz  II-  into  I  tK  2214 
Chip  Length  (usee):  100.00  Sample  Rate  (Hz):  52000  Code  Length:  127 

Samptes/Code:  660  Stalling  Sample:  0  Skip  Sample:  0 


Figures  3.3  and  3.4  illustrate  the  twin  correlation  peaks 
observed  in  the  15.59  MHz  data.  Again,  small  sidelobes  appear  at 
the  right  of  the  main  peaks  due  to  frequency  synthesizer 
mismatches.  Note  that  at  one  antenna,  the  first  main  peak  is 
larger  than  the  second  whereas  at  the  other  antenna,  the  second  is 
larger  than  the  first.  Between  the  major  peaks,  there  is  a  clean 
separation  of  approximately  225  usee.  This  separation  can  be 
explained  as  multipath  between  the  E-layer  and  F-layers .  For  an 
E-layer  height  of  100  km  and  F-layer  height  of  250  km.,  simple 
geometry  yields  a  path  difference  of  235  usee. 

Correlation  snapshots  at  both  antennas  taken  3  minutes  later 
are  shown  in  Figures  3.5  and  3.6.  Now  at  both  antennas,  the  second 
correlation  peak  is  higher  than  the  first.  During  this  short  time 
interval  the  relative  amplitude  of  each  pulse  has  also  changed 
significantly.  If  the  phase  of  the  signal  at  the  first  antenna 
would  have  been  measured  with  the  receiver  set  for  CW,  there  would 
have  been  a  significant  phase  jump  as  the  dominant  component 
changed  from  one  peak  to  the  other. 

Figures  3.7  and  3.8  illustrate  the  signals  captured 
simi  Itaneously  at  the  two  receiving  sites  the  previous  day.  In 
this  case  they  are  very  dissimilar,  each  site  having  a  different 
number  of  correlation  peaks  thus  indicating  the  complex  nature  of 
the  multipath  over  a  short  baseline  distance  of  only  290  m. 

These  results  should  be  taken  to  represent  a  proof  of  concept 
that  direct  IF  sampling  of  PN  codes  can  be  used  to  resolve 
multipath.  The  oscilloscope  has  four  channels  and  hence  could  be 
used  to  obtain  simultaneous  data  from  four  antennas.  However,  two 
addition  received  would  have  to  be  acquired  to  perform  these 
measurements . 

From  the  sequential  24  pulse  results,  over  a  300  millisec 
time  interval,  no  significant  amplitude  changes  were  noticed. 
However,  over  a  3  minute  interval,  very  significant  amplitude 
changes  were  noticed  occasionally.  It  would  be  of  interest  to 
obtain  non-sequential  data  snapshots  spanning  the  3  minute  period. 
This  could  be  done  with  some  additional  electronic  hardware  to 
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15.59  Mhz,  CW  Mode.  OdB  Pad.  455  kHz  IF  into  TEK  2214 
Chip  Length  (usee);  100.00  Sample  Rate  (Hz):  52CX)0  Code  Length;  1 27 

Samples/Code:  660  Code  Start:  200  Data  Start;  0 


1 5.59  Mhz,  CW  Mode.  0  dB  Pad.  455  kHz  IF  into  TEK  22 1 4 
Chip  l^ngUi  (usee);  100  00  Sample  Rate  (Hz);  52000  Code  Length;  127 

Samples/Code;  660  Code  Stan.  200  Data  Start;  0 
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Data  Flic;  cl  17  52  Time:  1991:8:15:10:46  56  EDT  Data  File:  aug 1 5  03a 


15.59  Mlu.CW  Mode.OdB  Pad.  455  kll/ IF  mio  I  l£K  2214 
Chip  L^ngtii  (usee):  100.00  Sample  Rate  (Hz);  52000  Ccxle  Length;  127 

Samples/Code;  660  Code  Start;  200  Data  Start;  0 
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15.59  Mhz.  CW  Mode.OdB  Pad.  455  kllz  IP  into  Tl-K  2214 
Chip  Length  (usee);  100.00  Sample  Rate  (Hz):  52000  Code  Length;  127 

Samples/Code;  660  Code  Start;  200  Data  Start:  0 


dB  Pad.  455  kHz  IFiiiio  l  EK  2214 

i  Rate  (Hz);  52000  Code  Length;  127 


15.59  Mhz.  CW  Mode.  0  dB  Pad.  455  kHz  IF  inio  TEK  2214 
Chip  Length  (usee):  100.00  Sample  Rate  (Hz):  52000  Code  Length:  127 

Samples/Code:  660  Code  Start:  200  Data  Start:  0 


incorporate  a  burst  control  into  the  data  sampling  frequency 
synthesizer . 

4.0  Conclusions 

The  means  of  azimuth  and  elevation  angle-of-arrival  were 
measured  and  found  to  be  quite  close  to  expected  results  from  map 
information  and  published  models.  In  addition,  standard  deviation 
of  elevation  was  measured  and  yielded  qualitatively  expected 
results.  However,  the  standard  deviation  of  azimuth  measurements 
did  not  follow  the  expected  trend  with  larger  baseline’  separation. 
Relative  phase  measurements  between  two  widely  spaced  antennas 
produced  results  with  periods  of  relative  phase  stability  and 
other  periods  of  sudden,  large  phase  jumps.  To  investigate  this 
further,  mode  separation  techniques  were  incorporated  into  the 
experimental  program. 

Mode  separation  was  done  via  the  transmission  of  a  127  length 
PN  code  sequence .  A  new  technique  employed  in  these  measurements 
was  direct  IF  sampling  and  the  conversion  to  baseband  performed 
via  software.  The  455  kHz  receiver  IF  amplitude  was  digitally 
sampled  at  52  KHz  with  8  bit  resolution.  Subsequent  correlation  to 
the  original  PN  code  sequence  separated  the  signal  cleanly  into 
two  components  at  15.59  MHz  and  a  single  component  at  24.5  MHz. 

The  time  separation  of  the  two  components  at  15.59  MHz  could  be 
matched  to  multipath  from  the  E-  and  F-layers.  At  15  MHz,  the 
relative  amplitude  of  the  2  components  could  change  significantly 
over  short  (minute)  periods.  For  a  receiver  operating  in  a  CW 
mode,  a  monitoring  of  the  IF  phase  (and  hence  the  carrier  phase) 
would  experience  a  significant  jump  when  the  dominant  amplitude 
shift  from  one  component  to  the  other  component.  This  effect  is 
probably  responsible  for  the  sudden  large  phase  changes  seen  the 
in  original  CW  data.  These  jumps  were  not  directly  observed  during 
the  phase  modulated  measurements  since  these  latter  were  only  300 
millisecond  snapshots. 

It  would  be  very  helpful  in  future  measurements  to  generate 
an  ionospheric  profile  via  a  sounder.  Locating  the  altitude  of  the 
significant  ionospheric  refxection  layers  could  corroborate  the 
observed  path  delays.  Instrumental  improvements  could  include 
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incorporating  an  improved  time  standard  which  could  be  used  to 
lock  to  the  transmitting  waveforms  and  generate  coherent  receiver 
IF  and  sampling  frequencies.  Higher  chip  rate  code  sequences  could 
be  used  to  investigate  multipath  phenomena  within  a  single  layer. 
Some  wideband  probing  has  been  reported  in  the  literature  [38]  .  It 
would  also  be  of  interest  to  transmit  other  codes  sequences  such 
as  complementary  codes.  These  latter  codes  have  the  property  of 
having  zero  sldelobes. 
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2.  ABSTRACT 


Most  amorphous  and  glassy  materials  exhibit  anomalies  in  their  low 
temperature  thermal  properties.  Similar  anomalies  were  observed  in  a 
number  of  high  temperature  superconductors.  It  is  believed  that  the 
anomalies  in  glasses  arise  from  tunneling  states  in  the  solid,  however, 
there  is  as  yet  no  microscopic  model  for  these  anomalies.  In  this  work  we 
start  with  microscopic  Hamiltonian  for  a  set  of  multi-orientational 
tunneling  (elastic)  quadrupoles  which  interact  with  a  quadrupolar 
interaction.  From  this  Hamiltonian  we  obtain  the  free  energy  for  low 
concentrations  of  tunneling  units  and  derive  the  first  completely 
microscopic  theory  for  the  low  temperature  properties  of  tunneling  quadru¬ 
poles  which  are  very  similar  to  those  observed  in  glasses. 

We  show  that,  contrary  to  the  currently  accepted  theory  on  glasses, 
interactions  play  a  very  important  role  in  the  low  energy  excitations  of 
tunneling  quadrupoles.  Our  work  suggests  that  current  approaches  to  the 
theory  of  amorphous  materials  and  glasses  may  have  to  be  modified  in  order 
to  make  further  progress  in  understanding  them. 
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3.  INTRODUCTION 


Amorphous  materials  have  a  good  deal  of  practical  importance  in 
technology  and  their  understanding  presents  a  serious  challenge  to  the 
experimental  as  well  as  theoretical  physicist.  There  has  been  a  good  deal 
of  recent  interest  in  the  study  of  the  properties  of  amorphous 
materials. [1,2]  This  interest  arises  because  experiments  on  glasses  and 
amorphous  materials  exhibit  low  temperature  anomalies  in  their  specific 
heat, [3-4]  thermal  conductivity,  thermal  expansion,  dielectric  relaxation 
and  sound  propagation  and  absorption.  The  early  experiments  concentrated 
mostly  on  amorphous  insulators.  More  recent[5]  ones  found  that  the  glassy 
anomalies  are  also  present  in  amorphous  metals. 

It  is  quite  remarkable  that  almost  all  amorphous  materials  show  very 
similar  low  temperature  anomalies.  To  explain  these  anomalies  it  was 
proposed  that  the  "glassy" [6]  system  is  made  up  of  two-level  tunneling 
states  (TLS)  [3,4]  with  a  random  distribution  of  barriers  between  the 
two  wells.  An  ad-hoc  assumed  phenomenological  constant  density  of  states 
gives  many  of  the  experimentally  observed  properties  of  glasses,  and  thus 
presents  a  good  deal  of  experimental  support  for  the  existence  of 
tunneling  states  in  glasses. [1,2]  However,  all  attempts  to  obtain  a 
microscopic  description  for  the  tunneling  states  or  for  the  constant 
density  of  states  have  had  very  limited  success. 

In  addition  to  most  amorphous  materials,  there  are  a  number  of  other 
systems  thdc  exhibit  "glasslike"  properties  at  low  temperatures.  Dilute 
concentrations  of  tunneling  electric  dipoles[7-8] ,  (for  example  Li'*'  in 
KCl)  or  (elastic)  strain  dipoles[9]  (for  example  CN"  dissolved  in  KCl) 
also  show  low  temperature  glasslike  anomalies.  In  fact  there  are  whole 
classes  of  impurities  which  when  dissolved  in  alkali  halides  show  glass- 


like  properties  of  one  form  or  another. [9-14] 

More  recently  there  has  been  a  good  deal  of  evidence  in  the  litera¬ 
ture  for  glassy  properties  in  a  number  of  high  temperature  supercon¬ 
ductors.  Evidence  for  possible  glassy  properties  have  been  seen  in 
the  sound  velocity[15-19],  in  the  thermal  conductivity[20-21]  and  in  the 
specific  heat. [21]  It  is  the  observation  of  the  glassy  properties  of  some 
of  the  high  temperature  superconductors  that  has  motivated  this  research. 

In  an  attempt  to  identify  the  origin  of  the  low-energy  excitations  in 
tunneling  units,  I  studied[22]  the  microscopic  properties  of  very  dilute 
two-orientational  Ising-model  tunneling-dipoles  which  interact  via  a  1/r^ 
interaction.  I  derived  a  set  of  microscopic  relations  for  the  thermal 
properties  of  the  tunneling  dipoles.  From  the  microscopic  derivations  I 
obtained  a  constant  density  of  excitation  energies  E  for  low  E  in 
agreement  with  experimental  results  observed  in  glasses.  Later  on[23,24]  I 
treated  multiorientational  tunneling  dipoles  (for  example  Li'*’  in  KCl)  and 
obtained  the  specific  heat,  the  thermal  expansion  and  the  complex 
dielectric  susceptibil ity[24]  for  the  tunneling  dipoles.  Again  these 
results  are  consistent  with  those  observed  in  glasses.  Thus  the  tunneling 
dipole  problem[24]  serves  as  a  first  microscopic  model  for  one  particular 
glass  system. 

In  spite  of  the  successful  derivation  of  glass'* ike  behavior[24]  for 
the  tunneling  dipoles,  the  derivations  are  not  expected  to  shed  much 
understanding  on  real  glasses  which  do  not  in  general  exhibit  large  local 
dipole  moments.  They  do,  however,  show  low  energy  excitations  which  affect 
the  elastic  constants  and  the  sound  velocity.  Such  effects  are  believed  to 
be  primarily  strain  dominated  and  were  observed  in  high  T^  supercon¬ 
ductors.  [15-19] 
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4.  OBJECTIVES  OF  THE  RESEARCH 


The  objective  of  the  research  was  to  formulate  a  microscopic 
approach  to  derive  the  thermal  properties  of  dilute  interacting  tunneling 
(elastic)  strain  quadrupoles  and  to  use  this  formulation  to  obtain 
expressions  for:  (a)  the  density  of  states  (excitation  energies);  (b)  the 
specific  heat;  (c)  the  thermal  conductivity;  (d)  the  sound  absorption  and 
propagation  and  (e)  the  dielectric  properties.  A  further  objective  was  to 
study  the  glasslike  properties  of  high  Tj.  superconductors.  It  was  hoped 
that  the  understanding  of  the  glasslike  properties  of  tunneling 
quadrupoles  will  also  help  to  understand  some  of  the  glasslike  properties 
of  high  temperature  superconductot^s. 
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5.  RESEARCH  ACCOMPLISHED 


In  this  section  we  briefly  describe  the  research  accomplished 
through  the  support  of  this  grant. 

As  outlined  in  the  introduction,  there  is  currently  no  microscopic 
theory  which  explains  the  universal  properties  of  amorphous  materials  or 
the  glasslike  properties  of  high  temperature  superconductors.  With  the 
help  of  work  which  I  started  during  my  Summer  Faculty  Research  Fellowship 
at  Rome  Air  Development  Center  (RADC)  I  have  been  able  to  develop  a 
microscopic  theory  for  dilute  tunneling  quadrupoles  randomly  distributed 
in  crystalline  hosts.  This  microscopic  theory  gives  glasslike  properties 
for  (elastic)  tunneling  strain  units  in  a  solid  from  fundamental 
considerations. 

First  we  formulated  the  problem  of  interacting  (elastic)  tunneling 
quadrupoles  randomly  distributed  in  a  nonpolar  host.  We  allowed  the 
tunneling  quadrupoles  to  interact  via  a  strain  dominated  interaction.  We 
obtained  low  energy  excitations  similar  to  those  observed  in  glasses.  A 
preliminary  report  on  this  work  was  presented  at  the  American  Physical 
Society  Meeting,  March  1990,  Los  Angeles,  CA.[25]  A  copy  of  this  publica¬ 
tion  is  included  as  Appendix  A  to  this  report. 

Next  we  examined  the  low  temperature  thermal  conductivity  and  found 
this  also  to  be  very  similar  to  that  found  in  glasses.  A  preliminary 
report  on  this  work  was  presented  at  the  American  Physical  Society 
Meeting,  March  1990,  Los  Angeles,  CA.[26]  A  copy  of  this  publication  is 
included  in  Appendix  B  to  this  report. 

Next  we  derived  the  low  temperature  specific  heat  for  the  tunneling 
quadrupoles.  We  showed  that  our  temperature  dependent  specific  heat 
predictions  are  in  very  good  agreement  with  low  T  experiments[27]  on  320 
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ppm  CN'  in  KBr  and  give  a  near-neighbor  CN"-CN‘  interaction  Kg  <  100  K. 
Previous  calculations,  using  the  model  developed  for  dipoles[22]  gave  a 
value  of  Kg  much  larger  than  both,  100  K  and  the  estimated  correct 
value. [27]  This  theory  also  explains  the  low  energy  excitations  observed 
in  rotary-echo  experiments[28]  from  CN"  pairs  in  KBr.  Thus  the  aerivations 
for  the  quadrupoles  explain  the  two  experiment  so  far  available.  However 
the  theory  predicts  all  the  thermal  properties  of  the  quadrupo'its,  many  of 
which  have  not  yet  been  measured.  It  is  hoped  that  this  theory  will 
stimulate  a  series  of  new  experiments  to  compare  with  the  predictions  of 
the  theory.  Some  suggested  measurements  a'*e:  (a)  the  concentration  depen¬ 
dence  of  the  specific  heat  for  'ow  concei.trations  and  low  temperatures; 
(b)  the  thermal  co-’ducti'.'ity;  (c)  the  temperature  and  frequency  dependent 
dielectric  constant. 

Upon  deriving  ou'*  temperature-de^jerdsnt  specific  heat  results  they 
were  presented  at  the  Intei national  Conference  on  Relaxation  in  Solids  at 
Crete,  Greece,  June  1/-29,  !S'?0.  These  results  were  written  up  and  were 
accepted  fcr  publication  by  the  Journal  for  Noncrystalline  Solids  and  will 
appear  in  April  1991. [29]  A  copy  of  this  paper  is  enclosed  as  Appendix  C 
of  this  report. 

In  further  derivations  we  showed  that  there  are  important  differences 
in  our  approach  for  the  tunneling  quadrupoles  problem  and  that  of  the 
accepted  TLS  model.  These  differences  are:  in  the  TLS^'^  the  low  energy 
excitations  arise  from  isolated  TLS  with  an  assumed  random  distribution  of 
potential  barriers,  in  our  derivations  they  arise  from  the  strongly 
interacting  tunneling  quadrupoles;  in  the  TLS  model  the  tunneling  matrix 
element  A  must  be  very  small  to  contribute  at  low  T,  in  our  derivation  A 
may  be  large  for  sufficiently  strong  interactions;  in  the  TLS  model  the 
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constant  density  of  states  is  assumed  without  justification,  in  our  work 
the  constant  density  and  other  glasslike®  properties  are  derived  from 
fundamental  principles;  the  TLS  model  can  not  derive  the  microscopic 
dielectric  constant  or  determine  whether  the  phonons  couple  to  the  TLS  via 
the  tunneling  matrix  element  A  or  the  anisotropy  energy,  while  in  our  work 
these  quantities  are  directly  obtained  fro...  a  microscopic  derivation.  Our 
work  thus  gives  the  first  completely  microscopic  model  for  the  glassy 
state  of  tunneling  quadrupoles  without  using  the  mean  field  approximation. 
Mean  field  approximation  gave  microscopic  models  for  higher  concentrations 
than  the  ones  treated  here. [30,31] 

We  also  showed  that  there  are  significant  differences  between  the 
results  obtained  for  the  tunneling  dipoles[24]  and  that  for  the  tunneling 
quadrupoles  derived  here.  For  the  Z^-orientational  tunneling  dipoles  the 
partition  function  factors  into  products  of  Ising  partition  functions, 
this  is  not  so  for  the  tunneling  quadrupoles.  The  microscopic  expressions 
for  the  dielectric  constant,  specific  heat  and  thermal  conductivity  are 
different  for  the  two  cases.  An  example  of  this  difference  is  exhibited  by 
the  calculation  of  the  CN'  near-neighbor  interaction  Kq  further  on  in  this 
letter.  Using  the  theory  for  the  tunneling  quadrupoles  gives  Kq  «  100  K, 
whereas  using  the  dipole  calculat1ons[22]  with  a  factorization  of  the 
partition  function  gives  [27]  Kq  450  K.  The  latter  is  much  greater  than 
what  is  believed  to  be  the  experimental  value. [27] 

The  results  of  these  derivations  were  submitted  and  accepted  for 
publication  in  the  Physical  Review  Letters,  published  on  December  10, 
1990. [32]  A  copy  of  the  manuscript  of  this  publication  is  also  enclosed  as 
Appendix  D  to  this  report. 
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Two  other  articles  on  the  detailed  derivation  of  these  results  have 
been  submitted  to  the  Physical  Review  under  the  titles:  "Glasslike 
excitations  from  dilute  interacting  tunneling  units  1:  Four  orientational 
tunneling  quadrupoles. [33]  Finally  and  "thermal  conductivity  of  dilute 
tunneling  quadrupole:  Are  the  two  types  of  tunneling  unit. [34]  Copies  of 
both  articles  is  enclosed  in  the  Appendix  E  and  Appendix  F  to  this  report. 

The  olassv  state  at  higher  concentrations.  The  work  outlined  above 
obtains  a  solution  of  the  dilute  quadrupole  problem  for  very  low 
concentrations  of  tunneling  quadrupoles  distribute  in  a  nonpolar  medium. 
The  derivations  give  low  energy  excitations  with  a  constant  density  of 
states.  The  derivations  arise  from  a  virial  expansion  of  the  free  energy 
in  the  impurity  concentration  c  and  therefore  is  valid  only  for  very  low 
c.  The  low  energy  excitations  arise  from  the  strongly  interacting 
tunneling  quadrupoles. 

What  happens  at  higher  concentrations  when  the  virial  expansion  is  no 
longer  valid?  In  such  a  case  one  has  to  resort  to  mean  field  or  other 
approximations  to  examine  the  glassy  state.  In  order  to  do  so  we  studied 
the  low  energy  excitation  spectrum  of  dilute  concentrations  of  interacting 
tunneling  quadrupoles  randomly  distributed  in  a  non-polar  medium  the  mean- 
field  approximation.  In  particular  we  consider  the  case  of  six 
orientational  tunneling  quadrupoles  with  a  r'^  (elastic)  Interaction.  To 
examine  the  nature  of  the  low  energy  excitations  for  higher  concentrations 
we  derive  the  free  energy  and. the  specific  heat  for  six  orientational 
tunneling  quadrupoles  in  the  mean  field  approximation.  The  internal  field 
is  a  random  variable  and  for  relatively  low  concentrations  has  a  Lorenzian 
distribution.  We  find  that  the  low  energy  density  of  states  is  a  constant 
and  that  the  low  energy  excitations  arise  from  the  large  internal  fields, 


4-10 


i.  e.  strongly  interacting  tunneling  quadrupoles  just  like  for  the  case  of 
the  very  low  concentration  limit  discussed  above.  This  work,  entitled  "Low 
energy  excitations  from  interacting  tunneling  units  in  the  mean-field 
approximation"  was  presented  at  the  International  Conference  of  Relaxation 
of  Complex  Systems,  Crete,  Greece,  June  17-29  1990;  it  was  also  submitted 
to  the  Journal  of  Noncrystalline  Solids  and  will  be  published  in  April 
1991. [35]  A  copy  of  this  manuscript  is  included  as  Appendix  G  to  this 
report. 

We  summarize  the  work  accomplished  under  this  contract.  A  good  deal 
of  progress  was  made  in  the  understanding  of  the  glasslike  properties  of 
the  tunneling  units  at  low  temperatures  and  of  the  nature  of  the  glassy 
state.  Similar  progress  was  made  in  the  understanding  of  the  specific 
heat,  thermal  expansion,  sound  absorption  and  propagation  and  dielectric 
properties  of  the  tunneling  quadrupoles.  We  have  derived  the  first 
microscopic  theory  for  the  glasslike  state  without  using  the  mean  field 
approximation.  Our  approach  is  completely  different  from  the  currently 
accepted  theories  on  glasses  and  is  quite  promising  that  it  will  lead  to  a 
better  understanding  of  the  microscopies  of  amorphous  systems;  this  has 
eluded  physicists  for  the  past  eighteen  years  in  spite  of  considerable 
effort  theoretical  effort. 

The  approaches  mentioned  above  for  the  tunneling  dipole  and  tunneling 
quadrupole  problem  can  be  quite  likely  also  applied  to  the  glasslike 
properties  of  high  T^  superconductors.  This  study  will  be  undertaken  in 
the  near  future. 
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6.  RECOMMENDED  FURTHER  WORK 

The  work  accomplished  in  this  proposal  has  been  highly  successful  in 
furthering  our  understanding  the  current  state  of  the  low  energy  excita¬ 
tions  in  randomly  distributed  tunneling  units.  The  work  accomplished  so 
far  seems  to  point  the  way  to  further  research  in  this  area.  It  has 
already  helped  clear  up  some  of  the  reasons  for  the  low  temperature 
anomalies  in  amorphous  materials  and  glasses.  A  good  approach  to  attack 
the  general  problem  seems  to  be  a  combination  of  low  concentration 
solutions  with  mean-field  approaches  at  higher  concentrations.  The  under¬ 
standing  of  this  problem  could  also  help  improve  the  possible  prediction 
of  new  amorphous  materials  with  various  desirable  properties. 

Next  we  would  like  to  address  ourselves  to  the  glassy  properties  of 
high  Tg  superconductors.  These  glassy  properties  have  an  important  effect 
on  the  pinning  of  the  flux  lines,  on  the  critical  current  densities  and  on 
other  physical  properties  in  high  T^  superconductors.  It  is  therefore  very 
important  to  understand  their  glassy  nature.  It  is  suggested  that  further 
work  should  be  done  using  some  of  the  methods  derived  in  the  enclosed 
papers  as  a  guide  to  understand  the  glassy  properties  of  high 
superconductors. 
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ABSTRACT 

REDUCED  BANDWIDTH  BINARY  PHASE-ONLY  FILTERS 

A  binary  phase-only  optical  correlator,  using  an  inexpensive 
liquid-crystal  television  as  the  spatial  light  modulator  for  the 
filter  function,  was  developed  and  tested.  A  niunber  of 
simulations,  and  a  small  number  of  experimental  runs,  investigated 
the  effects  of  reducing  the  bandwidth  of  the  filter.  The  low-pass 
effect  allows  a  trade  of  specificity  for  tolerance,  and  binary 
phase-only  correlation  intensity  peaks  can  be  made  to  look  much 
those  obtained  for  a  matched  filter.  A  high-pass  prefiltering 
operation  can  increase  specificity,  at  a  cost  of  reduced  tolerance 
to  scale  changes  and  rotations.  The  quality  of  the  experimental 
correlator  performance  obtained  was  not  good,  but  was  adequate  for 
demonstration  purposes,  and  could  be  improved  with  further 
development . 
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REDUCED  BANDWIDTH  BINARY  PHASE-ONLY  FILTERS 


INTRODUCTION 

The  purposes  of  this  research  program  were 

1.  To  set  up  an  experimental  correlator  system  to  verify 
simulation  results  and  provide  guidance  into  effects  that 
need  to  be  included  in  simulations. 

2.  To  investigate,  both  theoretically  and  experimentally,  the 
advantages  and  disadvantages  of  operating  the  phase-only 
filter  at  reduced  spatial  bandwidths  (low-pass,  high-pass, 
and  bandpass). 

To  accomplish  these  purposes,  procedures  followed  included  running 
a  number  of  simulations  on  diverse  patterns,  designing  a  test  bed 
correlator  based  on  an  inexpensive  liquid  crystal  television  as  the 
spatial  light  modulator  (the  correlator  filter),  and  testing  the 
correlator  system,  with  special  attention  given  to  the  effect  of 
reducing  spatial  bandwidth. 

PRELIMINARY  DISCUSSION;  BASIC  CONCEPTS 


The  use  of  binary  phase-only  filters  in  optical  correlators  was 
suggested  and  popularized  by  Horner  and  co-workers  [1,2,3].  Work 
in  image  processing  had  previously  shown  that  most  of  the 
information  content  of  images  resides  in  phase  information,  with 
very  little  attributable  to  amplitude  information  [4].  Details  of 
the  theory  behind  the  use  of  binary  phase-only  filters  in 
correlators  were  subsequertly  mapped  out  by  many  researchers. 

A  very  simplified  overview  of  the  concepts  included  in  this  report 
can  be  obtained  by  considering  a  one-dimensional  pulse,  such  as 
that  shown  in  Figure  1.  The  pulse  depicted  is  of  unit  height  and 
is  9  pixels  wide  (in  a  field  of  128  pixels).  The  Fourier  transform 
of  this  pulse  is  the  familiar  sin(x)/x  waveform  shown  in  Figure  2. 
Note  that  much  of  the  area  under  the  curve  is  in  the  central  lobe, 
the  low-frequency  Fourier  components,  with  diminishing  amplitudes 
at  higher  spatial  frequencies.  Fourier  components  are  normally 
complex,  but  for  this  special  case  of  a  symmetric  waveform  centered 
about  the  coordinate  origin,  all  amplitudes  have  phase  angles  of  0 
or  X  radians.  The  application  of  a  binary  phase-only  filter  (BPOF) 
to  this  spectrum  is  precisely  equivalent  to  simply  taking  the 
magnitude.  The  magnitude  of  the  transform  of  the  pulse  waveform  is 
shown  in  Figure  3.  Retransforming  the  filtered  spectrum,  that  is, 
taking  the  Fourier  transform  of  the  magnitude  of  the  Fourier 
transform  of  the  rectangular  pulse,  results  in  the  correlation 
waveform  shown  in  Figure  4.  The  correlation  shows  a  narrow  high 
peak  at  the  location  of  the  original  pulse,  and  some  much  weaker 
sidelobes  are  in  evidence.  This  is  behavior  that  is  typical  of 
BPOF  correlators. 
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The  high  peak  results  from  the  addition  of  in-phase  components,  so 
if  the  filter  blocked  further-out  lobes  from  contributing  to  the 
transform  summation,  the  observed  peak  would  become  lower  and 
broader.  On  the  other  hand,  it  would  then  be  less  sensitive  to  any 
differences  in  scale  or  angle  rotations.  Reducing  the  bandwidth  of 
a  phase-only  filter  allows  a  correlator  to  be  designed  with  any 
desired  tradeoff  between  specificity  and  tolerance. 

SIMULATIONS 

The  liquid  crystal  tv  used  as  a  spatial  light  modulator  filter  does 
not  have  sufficient  resolution  to  use  images  of  any  complexity  as 
test  patterns.  Consequently,  all  test  patterns  used  were 
relatively  simple  geometric  shapes,  and  a  binary  amplitude  pattern 
was  used  (no  gray  scale).  Three  patterns  were  used  in  the 
investigations:  A  simple  square  (Figure  5),  a  stylized  version  of 
an  airplane  (Figure  6),  and  text  spelling  out  "OKLAHOMA  SOONERS" 
(Figure  7 ) . 

The  Fourier  transform  magnitude  of  the  39x39  pixel  square  of  Figure 
5  is  depicted  in  Figure  8.  Correlations  of  this  square  with 
filtered  versions  of  itself  are  given  in  Figures  9  through  14.  All 
correlations  are  plots  of  intensity,  not  amplitude.  A  matched 
filter  (MF)  is  used  in  the  correlation  shown  in  Figure  9,  while  a 
binary  phase-only  filter  (BPOF)  is  used  in  the  correlation  depicted 
in  Figure  10.  Figures  11  and  12  show  the  effect  of  reducing  the 
bandwidth  of  the  BPOF.  Figure  ll's  filter  passed  only  the  main 
lobe  of  the  Fourier  transform,  while  the  filter  used  in  Figure  12 
passed  the  main  lobe  plus  the  next  ones  (zero  order  and  first  order 
terms).  Note  the  similarity  in  appearance  and  peak  value  to  the 
matched  filter,  suggesting  that  reduced-bandwidth  BPOFs  can  be  used 
as  approximations  to  matched  filters,  at  least  for  images  that  are 
not  too  complex.  Figure  13  shows  the  correlation  obtained  for  a 
high-pass  filter.  All  but  the  central  lobe  is  used  for  this 
filter.  As  expected,  a  correlation  involving  a  high-pass  filtering 
operation  results  in  a  very  sharp  peak,  with  the  concomitant 
increase  in  sensitivity  to  rotations  or  scale  changes.  Figure  14 
shows  a  bandpass  filter  correlation.  Only  the  2nd,  3rd,  and  4th 
lobes  are  used  in  constructing  this  filter.  Note  that  it  has  a 
much  reduced  correlation  peak,  suggesting  that  its  use  would  be 
restricted  to  distinguishing  nearly  identical  images. 

The  Fourier  transform  magnitude  for  the  plane  of  Figure  6  is  shown 
in  Figure  15.  Figure  16  shows  correlations  for  the  plane  image 
using  a  BPOF,  a  CPOF  (continuous  phase-only  filter),  and  an  MF. 
The  peak  to  sideband  ratio  (PSR)  is  the  ratio  of  the  correlation 
peak  value  to  the  sum  of  pixel  values  that  are  less  than  half  of 
the  peak  value.  Figure  17  is  the  same  three  correlations,  but  with 
added  noise  this  time.  The  added  noise  was  of  sufficient  amplitude 
to  change,  on  the  average,  every  4th  pixel  value.  A  signal  to 
noise  ratio  (SNR)  was  calculated  as  the  ratio  of  the  correlation 
peak  intensity  to  the  average  noise  intensity.  As  expected,  the 
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matched  filter  perforins  best  for  this  measure,  since  it  is 
optimized  to  do  just  that.  The  BPOP  performance  is  considerably 
degraded,  but  it  does  still  clearly  show  a  correlation  peak. 
Figure  18  shows  correlations,  with  and  without  added  noise  (of  the 
same  0.25  watt  per  pixel  level  as  before),  for  the  plane  image 
using  a  reduced  bandwidth  BPOF.  Only  the  central  4  lobes  of  the 
Fourier  transform  are  used  for  this  simulation.  A  comparison  of 
Figure  18(b)  with  Figure  17(a)  shows  that  reducing  the  filter 
bandwidth  improves  the  SNR  by  a  factor  6f  2.15  in  this  case. 


The  last  set  of  simulations  was  based  on  an  image  consisting  of  the 
letters  OKLAHOMA  SOONERS,  previously  shown  in  Figure  7.  The 
interesting  aspect  of  this  particular  simulation  is  that 
correlation  is  used  to  distinguish  symbols  whose  properties  are 
quite  similar.  The  Fourier  transform  magnitude  for  OKLAHOMA 
SOONERS  is  shown  in  Figure  19.  Figure  20  depicts  Fourier  transform 
magnitudes  for  the  letters  O  and  S  by  themselves,  and  Figures  21 
and  22  show  the  central  columns  of  the  3-dimensional  graphs  of 
Figure  20.  The  transforms  of  these  letters  are  seen  to  be  quite 
similar  in  appearance. 

Figure  23  shows  the  correlation  intensities  obtained  for  OKLAHOMA 
SOONERS  using  a  BPOF  based  on  the  letter  0.  The  4  occurrences  of 
the  letter  0  are  clearly  indicated  by  strong  correlation  peaks 
(remember  that  the  image  is  inverted  in  the  correlation  plane). 
The  values  of  autocorrelation  intensities  vary  from  27.0  to  28.4, 
while  the  highest  cross-correlation  intensity  is  5.64  (for  the 
letter  M) .  Correlations  of  the  letter  S  with  OKLAHOMA  SOONERS  are 
shown  in  Figure  24.  Autocorrelation  peak  intensities  are  26.6  and 
28.9,  while  the  highest  cross-correlation  peak  is  7.92  (for  the 
letter  E) ,  followed  by  3.92  (for  the  letter  0). 

For  the  next  simulation,  the  BPOF  for  the  letter  O  was  modified  by 
blocking  the  central  lobe,  to  achieve  a  high-pass  effect.  The 
resulting  correlations  of  0  with  OKLAHOMA  SOONERS  is  shown  in 
Figure  25.  The  autocorrelation  peak  intensities  range  from  15.7  to 
16.6,  and  the  highest  cross-correlation  intensity  is  2.32  (again 
the  letter  M,  as  for  the  ful 1 -bandwidth  case).  The  high-pass 
filtering  resulted  in  improving  the  discrimination  by  a  factor  of 
about  1.4  in  this  example.  Since  all  the  letters  have  about  the 
same  area,  it  is  not  expected  that  low  frequency  components  are  of 
much  use  in  distinguishing  one  letter  from  another.  Doing  the  same 
high-pass  operation  on  the  BPOF  for  the  letter  S  produced  the 
correlations  shown  in  Figure  26.  The  two  autocorrelation 
intensities  were  13.35  and  14.90,  while  the  highest  cross¬ 
correlation  intensity  was  4.44  (for  E,  as  before).  The  use  of 
high-pass  filtering  results  in  a  slight  degradation  of  the 
discrimination  ability  in  this  instance.  The  average 
autocorrelation  intensity  to  largest  cross-correlation  intensity 
reduces,  by  a  factor  of  about  0.91. 


The  next  operation  investigated  was  low-pass  filtering.  In  Figures 
27  and  28,  only  the  main  lobe  of  the  BPOFs  were  used.  Figure  27 
shows  correlation  intensities  for  the  letter  0  with  OKLAHOMA 
SOONERS,  while  Figure  28  is  based  on  a  BPOF  central  lobe  for  the 
letter  S.  The  four  0  autocorrelation  intensities  range  from  6.50 
to  7.23,  while  the  highest  cross-correlation  intensity  is  3. OS  (for 
the  letter  A),  with  others  close  behind.  The  two  S  autocorrelation 
intensities  are  7.15  and  7.98,  while  the  highest  cross-correlation 
intensities  are  two  occurrences  of  3.37  (for  letters  E  and  O) . 

SLM  HARDWARE  AND  SOFTWARE 


The  spatial  light  modulator  (SLM)  used  as  a  filter  was  a  liquid 
crystal  television  (LCTV),  a  Realistic  model  16-156.  This  device's 
liquid  crystal  display  consists  of  a  matrix  of  158  by  144  pixels  on 
a  7.1  by  5.3  cm  panel.  The  pixel  spacing  was  approximately  445  by 
365  vun.  Two  LCTVs  of  the  same  brand  and  model  were  purchased,  and 
they  were  found  to  differ  somewhat  in  detail.  The  data  presented 
is  for  the  one  selected  for  use  as  a  filter  in  the  experimental 
work.  Pixel  spacing  was  approximately  445  by  365  um. 
Approximately  80  percent  of  each  pixel's  area  was  active  area,  with 
the  remaining  20  percent  clear  interpixel  area  in  which  optical 
beam  phase  rotations  were  not  affected  by  voltages  applied  to  the 
pixels . 

As  discussed  in  Tai  [5],  several  modifications  must  be  made  to  the 
LCTV  to  convert  it  into  a  useful  SLM.  A  stop  must  be  broken  to 
enable  the  liquid  crystal  display  to  be  positioned  at  a  90  degree 
angle  to  the  main  tv  unit.  A  diffuser  and  two  polarizers  (of  very 
low  quality)  must  be  removed  from  the  display  unit.  What  remains 
is  a  nematic  liquid  crystal  sheet  sandwiched  between  thin  glass 
plates . 

In  order  to  use  the  LCTV  as  an  SLM,  it  is  also  necessary  to  be  able 
to  use  a  computer  to  program  the  individual  pixels.  The  LCTVs  used 
were  equipped  with  composite  video  input  jacks.  Available 
computers  were  DOS  Pcs.  Interfacing  circuitry  is  thus  required  to 
convert  the  video  output  of  the  computers  into  the  format  required 
by  the  LCTV  video  input.  A  circuit  board,  based  on  a  Motorola 
MC1377  Color  Television  RGB  to  NTSC  Encoder  integrated  circuit,  was 
purchased  and  found  to  not  work.  After  some  design  modification, 
a  workable  circuit  was  obtained  to  perform  interfacing  between  the 
computer's  digital  RGB  format  and  the  LCTVs  analog  NTSC  format. 
The  interface  circuitry  allowed  computer-generated  images  to  be 
displayed  on  the  LCTV  screen,  but  addressing  individual  pixels 
required  additional  software.  The  CGA  graphics  screen  consists  of 
200  by  640  pixels  which  address  133  by  149  (of  144  by  158)  pixels 
of  the  LCTV.  This  mapping  is  due  to  the  computer's  video  timing 
signal.  In  order  to  vertically  map  computer  pixels  to  LCTV  pixels, 
each  two  out  of  three  computer  rows  were  mapped  to  one  row  of  LCTV 
pixels.  There  is  no  correspondingly  good  horizontal  mapping,  thus 
some  horizontal  blurring  is  unavoidable.  The  horizontal  blurring 


distortion  was  minimized  by  a  trial-and-error  process.  The  end 
result  was  that  a  fairly  complex  pattern  of  using  particular 
computer  pixels  and  skipping  others  was  found  to  work  quite  well. 
Computer  programs  were  written  to  allow  desired  patterns  to  be 
written  on  the  LCTV. 

LCTV  CHARACTERIZATION  AND  PARAMETER  OPTIMIZATION 

Now  that  hardware  and  software  was  in  place  that  permitted  a 
desired  pattern  to  be  generated  on  a  computer  and  transferred  to 
the  LCTV,  the  LCTV  was  tested  extensively  to  determine  optimum 
settings  of  the  operational  variables.  These  variables  were  the 
input  polarization  angle,  output  polarizer  angle,  and  the  amount  of 
dc  bias  used  with  the  liquid  crystal  panel.  Several  authors  have 
given  theory  and  data  for  inexpensive  LCTVs  [6, 7, 8, 9] 

The  birefringent  axes  of  the  liquid  crystal  material  rotate  as  the 
LCTV's  dc  bias  voltage  (television  contrast  control)  is  varied.  To 
determine  the  orientation  of  birefringent  axes,  the  bias  voltage 
was  varied  from  13.4  to  19.4  volts  dc  in  0.2  volt  steps,  kt  each 
step  the  input  beam  polarization  is  varied  until  an  analyzer  at  the 
output  can  obtain  the  most  complete  null  possible.  This  is  done 
both  for  pixels  in  the  on  state  and  pixels  in  the  off  state.  The 
result  of  this  measurement  is  shown  in  Figure  29.  Measurements  of 
a  number  of  LCTV  parameters  versus  contrast  voltage  are  shown  in 
Figure  30.  Based  on  this  data,  the  maximum  difference  in  ON  state 
and  OFF  state  polarizations  is  11*,  occurring  at  a  bias  voltage  of 
18  V.  The  various  angles  and  polarizations  appropriate  to  this 
choice  are  shown  in  Figure  31.  It  is  also  important,  however,  to 
block  the  interpixel  light,  as  this  adds  only  noise  at  the 
correlation  plane.  Unfortunately,  it  is  not  possible  to  completely 
block  interpixel  light  and  still  use  the  LCTV  as  a  phase  modulator. 
The  best  bias  voltage  for  blocking  interpixel  light  is  14.6  V.  A 
vector  diagram  for  this  choice  is  shown  in  Figure  32.  It  is  seen 
that  an  ON-OFF  phase  difference  of  only  2*  is  obtained  at  this  bias 
point.  Since  it  is  not  possible  to  simultaneously  maximize  the  ON- 
OFF  phase  difference  and  eliminate  interpixel  light,  a  compromise 
bias  voltage  of  16.4  V  was  used  for  the  experimental  work.  As 
shown  in  Figure  33,  this  choice  produces  an  ON-OFF  phase  difference 
of  7*,  but  unfortunately  passes  82  percent  of  the  interpixel  light. 
The  net  result  is  that  correlation  plane  images  were  superpositions 
of  the  original  image  and  correlation  intensities. 

One  concern  usually  voiced  for  using  inexpensive  LCTVs  as  SLMs  is 
that  the  phase  is  not  uniform  over  the  extent  of  the  display.  This 
difficulty  has  been  overcome  by  mounting  the  LCTV  in  a  liquid  gate 
[10],  sandwiching  the  device  between  optical  plates  [11], 
correcting  the  phase  holographically  [12],  and  encoding  a 
correction  into  a  BPOF  [13].  Horner  [14]  suggests  that  phase 
correction  might  not  be  necessary  if  phase  distortion  is  on  the 
order  of  1.6x  or  less.  Interferometric  measurements  on  the  LCTV 
used  in  this  experimental  system  indicated  that  its  phase 

5-7 


'istortion  was  not  much  worse  than  this,  so  no  phase  correction  was 
used  in  the  experimental  work  described  in  this  report. 

CORRELATOR  TEST  BED  DESIGN 


The  basic  configuration  to  be  used  is  the  standard  4f  or  Vanderlugt 
correlator.  The  minimum  focal  length  that  can  be  used  is  dictated 
by  the  pixel  sizes  and  overall  extents  of  the  input  and  filter 
patterns.  Using  the  rather  large  pixel  spacing  of  the  liquid 
crystal  tv  (LCTV),  365  x  445  pm,  and  an  overall  extent  of  120  x  146 
pixels  for  both  input  and  filter,  a  minimum  focal  length  to  avoid 
aliasing  is  calculated  to  be  approximately  20  m.  Design 
constraints  were  that  the  system  should  use  only  available  2  inch 
diameter  lenses,  all  tolerances  should  be  correctable  by 
positioning  adjustments,  and  the  overall  length  should  be  less  than 
8  m  (to  fit  on  an  laboratory  countertop).  The  desired  20  m  focal 
length  was  achieved  in  a  shorter  distance  by  using  lens  pairs  in  a 
telephoto  configuration,  and  the  overall  length  was  further  reduced 
by  using  a  phase  correction  lens  at  the  filter  plane.  The  final 
correlator  design  is  shown  in  Figure  34.  All  lenses  are  mounted  on 
rails,  permitting  all  fine-tuning  adjustments  to  be  made  by 
adjusting  lens  positions.  Because  of  the  LCTV's  large  pixel  size, 
the  overall  length  is  somewhat  longer  than  usually  used  in 
correlator  test  beds,  but  the  design  proved  to  be  quite  workable. 

EXPERIMENTAL  CORRELATOR  RESULTS 

Although  input  transparencies  were  prepared  for  additional  images, 
the  only  image  used  in  experimental  work  was  that  of  a  simple 
rectangle,  a  slit  6.935  mm  tall  and  1.422  mm  wide.  The  rectangular 
aperture  was  constructed  using  four  razor  blades.  Because  of 
delays  in  getting  the  system  operational,  no  attempt  was  made  to 
take  data  for  more  complex  inputs  (some  parts  ordered  for  use  in 
the  correlator  system  did  not  arrive  until  after  all  work  had  been 
completed).  In  order  to  ensure  that  the  image's  Fourier  transform 
scale  matched  that  of  the  filter  written  on  the  LCTV,  the 
calculated  Fourier  transform  was  drafted  on  paper  (eminently 
doable,  thanks  to  the  relatively  large  scale  involved),  and  the 
size  of  the  rectangle  was  trimmed  to  make  the  transform-plane 
pattern  agree  with  the  calculated  pattern.  All  final  adjustments, 
such  as  small  adjustments  in  the  x-y  position  of  the  LCTV,  were 
made  while  observing  the  correlation  plane  output,  captured  by  a 
frame  grabber  from  the  CCD  camera  and  displayed  on  a  computer 
monitor. 

The  image  observed  at  the  correlation  plane  had  a  noticeable 
interference  bar  pattern,  with  a  slight  vibration  effect  also 
noticeable.  The  very  small  pixel  size  of  the  CCD  camera  thus 
resulted  in  a  correlation  peak  intensity  that  varied  with  tine. 
The  relative  time  during  a  LCTV  scan  cycle  at  which  an  image  was 
captured  may  also  have  been  a  consideration.  Figures  35,  36,  and 
37  show  three  examples  of  detected  correlation  plane  intensities. 
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Some  averaging  of  adjacent  pixels  had  to  be  done  to  acconvnodate 
restrictions  imposed  by  the  3'dimensional  mesh  plotting  program, 
but  the  overall  appearance  is  little  changed.  Figure  35  shows  a 
correlation  peak  of  220,  with  a  background  average  noise  level  of 
45.35.  The  SNR  is  thus  4.85.  The  average  of  intensities  below  50 
percent  of  the  peak  is  51.08,  thus  the  PSR  is  4.3  for  the  image  of 
Figure  35.  For  Figure  36,  the  correlation  peak  intensity  was  255, 
the  noise  level  was  47.45,  and  the  average  of  intensities  below  50 
percent  of  the  peak  was  51.11.  The  resulting  ratios  are  then  PSR 
=  4.99,  SNR  =  5.37.  For  Figure  37,  the  correlation  peak  intensity 
was  234,  the  noise  level  was  47.54,  and  the  average  of  intensities 
below  50  percent  of  the  peak  was  51.03.  Ratios  are  PSR  =  4.59,  SNR 
=  4.92.  The  appearances  of  the  correlation  plane  images  differ 
considerably  from  the  ideal  of  sharp  narrow  correlation  peaks,  but 
at  least  a  correlation  peak  is  clearly  evident.  The  image  of  the 
input  rectangle,  transmitted  as  interpixel  light,  is  also  evident. 
Probable  factors  leading  to  a  reduced  correlation  peak  include 
decorrelation  at  higher  spatial  frequencies  due  to  a  slight  scale 
mismatch,  defocussing  due  to  nonuniform  phase  distribution  across 
the  LCTV  aperture,  scattering  due  to  use  of  a  low-quality  polaroid- 
type  polarizer,  and  perhaps  some  system  misalignment  due  to 
vibration,  as  the  8  m  long  system  was  not  installed  on  an  anti¬ 
vibration  table. 

High-pass  and  low-pass  operations  were  performed  by  physically 
blocking  parts  of  the  filter  aperture.  Figure  38  is  an  example  of 
the  effect  of  high-pass  filtering.  The  zero  order  main  lobe  of  the 
filter  response  was  blocked  for  this  measurement.  The  result  was 
a  correlation  peak  of  86,  separating  two  other  peaks.  Noise 
background  intensity  was  47.8,  the  SNR  is  1.8.  There  are 
essentially  no  intensities  below  50  percent  of  the  peak,  so  no  PSR 
can  be  calculated.  Figure  39  shows  the  results  of  a  low-pass 
experiment.  A  variable  circular  aperture  was  placed  after  the  LCTV 
to  achieve  a  low-pass  filtering  effect.  This  particular  experiment 
had  a  correlation  intensity  peak  of  165,  with  a  background  noise 
level  of  48.11.  The  average  of  intensities  below  50  percent  of  the 
peak  was  50.09.  The  resulting  PSR  is  3.29,  and  the  SNR  is  3.43. 

Lastly,  Figure  40  shows  a  cross-correlation  with  another 
unspecified  rectangle.  This  graph  shows  two  peaks  of  not  too 
different  height.  The  taller  has  an  intensity  of  144.  The 
background  noise  intensity  is  46.82.  One  interesting  feature  that 
shows  up  well  in  this  graph  is  the  slit  image,  caused  by  interpixel 
light  transmission. 

CONCLUSIONS  AND  RECOMMENDATIONS 

The  simulations  that  have  been  done  suggest  that  low-pass  filtering 
could  play  a  useful  role  in  applications  such  as  target 
recognition.  The  low-pass  correlation  produces  an  indication  that 
a  blob  of  about  the  right  size  is  present,  and  could  be  followed  by 
a  more  detailed  examination  if  suitable  candidates  are  indicated. 
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High-pass  filtering  produces  a  sharper  correlation  peak,  but  at  the 
expense  of  heightened  sensitivity  to  scale  changes  and  rotations. 
A  high-pass  filtered  correlation  is  likely  to  find  use  only  if  a 
very  specific  size  and  orientation  are  being  sought.  Bandpass 
filtering  is  unlikely  to  be  of  much  use  unless  the  major 
differences  between  similar  image  objects  lies  in  a  narrow  range  of 
spatial  frequencies. 

The  large  pixel  size,  lack  of  resolution,  and  small  dynamic  range 
all  conspire  to  restrict  the  LCTV  to  demonstration-type  uses.  The 
fact  that  interpixel  light  could  not  be  eliminated  in  the  unit 
employed  was  a  major  disappointment.  The  fact  that  LCTV  pixels  are 
rectangular,  not  square,  is  an  annoyance.  Still,  with  further 
development,  it  is  clear  that  the  LCTV  could  be  a  useful  component 
in  a  correlator  test  bed.  One  further  development  that  would  be 
interesting  to  pursue  would  be  to  develop  a  program  to  cause  the 
LCTV-written  filter  to  optimize  itself. 
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Figure  1.  A  1 -dimensional  rectangular  pulse  waveform  used  to 
demonstrate  the  basic  idea  of  a  binary  phase-only  filter. 
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Figure  2.  The  Fourier  transform  of  the  pulse  waveform  of  Figure  1. 
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Figure  3.  The  magnitude  of  the  Fourier  transform  of  the  pulse 
waveform  of  Figure  1  (result  of  a  binary  phase-only  operation). 
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Figure  4.  Fourier  transform  of  the  magnitude  of  the  transform  of 
the  pulse  of  Figure  1,  equivalent  to  a  correlation  using  a  binary 
phase-only  filter. 
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Figure  7.  Contour  graph  of  the  text  pattern  used  in  simulations. 
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10,  Correlation  of  the  rectangular  pattern  of  Figure  5  with 
using  a  binary  phase-only  filter.  The  correlation  peak 
ty  is  38.09. 
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Figure  11.  Correlation  of  the  rectangular  pattern  of  Figure  5  with 
itself  using  a  low-pass  processed  BPOF.  Only  the  main  lobe  of  the 
Fourier  response  is  kept.  The  correlation  intensity  peak  is  1.95. 


Figure  12.  Correlation  of  the  rectangular  pattern  of  Figure  5  with 
itself  using  a  low-pass  filtering  operation  with  a  BPOF.  Only  the 
main  lobe  and  the  next  lobes  out  are  retained.  The  correlation 
intensity  peak  value  is  2.71. 
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Figure  13.  Correlation  of  the  rectangular  pattern  of  Figure  5  with 
itself  using  a  high-pass  filtering  operation  with  a  BPOF.  Only  the 
central  lobe  of  the  Fourier  response  is  suppressed.  The 
correlation  intensity  peak  is  22.8. 


Figure  14.  Correlation  of  the  rectangular  pattern  of  Figure  5  with 
itself  using  a  bandpass  filtering  operation  with  a  BPOF.  Only  the 
2nd,  3rd,  and  4th  lobes  of  the  Fourier  transform  response  are  used. 
The  correlation  intensity  peak  is  only  0.26. 
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Figure  15.  Magnitude  of  Fourier  transform  for  airplane  image  of 
Figure  6. 
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Figure  17.  Same  as  Figure  16,  but  with  added  noise 
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Figure  19,  Fourier  transform  magnitude  for  OKLAHOMA  SOONERS  text 
pattern , 
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Figure  20.  Fourier  transform  magnitudes  for  OKLAHOMA  SOONERS 
letter  "S"  (top)  and  "0"  (bottom). 
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Figure  23.  Correlations  of  the  letter  "0* 
Remember  that  the  text  is  inverted  in  the 
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Figure  24.  Correlations  of  the  letter  "S”  with  OKLAHOMA  SOONERS 
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Figure  25.  Effects  of  a  high-pass  filtering  operation  on  the 
correlation  of  the  letter  ’’O"  with  OKLAHOMA  SOONERS. 
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Figure  26.  Effects  of  a  high-pass  filtering  operation  on  the 
correlation  of  the  letter  "S"  with  OKLAHOMA  SOONERS. 
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Figure  28.  Effects  of  a  low-pass  filtering  operation  on  the 
correlation  of  the  letter  "S"  with  OKLAHOMA  SOONERS. 
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Figure  29. 
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Figure  30.  LCTV  polarization  angles  as  a  function  of  contrast 
vol tage . 
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IP  -  Interpixel 
A  -  Analyzer 
IN  -  Input 


B  -  Bisector 
OFF  -  Off  State  Output 
ON  *  On  State  Output 


Figure  31.  Vector  diagram  of  LCTV  polarization  angles  for  maximum 
difference  between  on  state  and  off  state  output  polarization 
angles  (contrast  voltage  used:  18  V). 
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Figure  32.  Vector  diagram  of  LCTV  polarization  angles  for  maximum 
blockage  of  interpixel  light  (contrast  voltage  used;  14.6  V). 
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Figure  33.  Vector  diagram  of  LCTV  polarization  angles  for  the 
compromise  setting  used  for  the  experimental  data  runs  (contrast 
voltage  used:  16.4  V). 
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Figure  34.  The  test  correlator  configuration. 
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Figure  35. 
Example  No. 


Experimental  autocorrelation 
1.  The  relative  correlation 


result  using  a  LCTV  BPOF 
peak  intensity  is  220. 
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Figure  36.  Experimental  autocorrelation  result  using  a  LCTV  BPOF 
Example  No.  2.  The  relative  correlation  peak  intensity  is  255. 


Figure  38. 
filtering . 


Same  as  Figures  35  -  37,  but  with  added  high-pass 
The  relative  correlation  peak  intensity  is  86. 
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Figure  39. 
filtering . 


Same  as  Figures  35  -  37,  but  with  added  low-pass 

The  relative  correlation  peak  intensity  is  165. 
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Figure  40.  Example  of  a  cross-correlation  of  a  rectangle  pattern 
with  a  different  rectangle.  The  higher  correlation  peak  has  a 
relative  intensity  of  144. 
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Chapter  I 


Computer  Modeling  of  GaAs/AlGaAs 
MQW  Devices  for  Optical  Properties 


Introduction 

The  objective  of  this  research  is  to  develop  a  heterostructure  device 
model  leading  to  an  optical  gate  for  optical  computing  systems.  One 
form  of  optical  element  is  similar  to  an  injected  laser  diode,  and  is 
thus  reasonably  compatible  with  the  size  of  optical  computer  processing 

t 

elements,  integrated  optic  waveguide  and  optical  fiber  interconnection. 
Mutiple  Quantum  Well  (MQW)  devices  have  appeared  as  logic  gates  to 
perform  entire  range  of  digital  logic  fimctions,  similar  to  conventional 
electonics  building  blocks.  [1] 

It  is  well  known  that  the  introduction  of  quantmn  wells  (with  a 

o 

thickness  I,  ~  lOOA)  in  the  active  layer  of  a  double  heterostructures 
GaAs/AlGaAs  drastically  improves  its  performence  [6-7].  In  these 
structures  the  carrier  motion  normal  to  the  active  layers  is  restricted. 
As  a  result,  the  kinetic  energy  of  the  carriers  moving  in  the  normal 
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direction  are  quantized  into  discrete  energy  levels  similar  to  the  well- 
known  quantum  mechanical  problem  of  the  one  dimensional  poten¬ 
tial  well,  and  hence  the  name  quantum  well  (QW)  lasers.  When  the 
thickness  of  the  active  region  becomes  comparable  to  the  de  Broglie 
wavelength  (A  ~  /i/p),  quantum  mechanical  effects  are  expected  to  oc¬ 
cur.  These  effects  are  observed  in  the  absorption  and  emission  char¬ 
acteristics  and  transport  characteristics  including  phenomena  such  as 
tunneling.  The  optical  characteristics  of  semiconducting  QW  double 
heterostructures  were  intially  studied  by  Dingle  et  al  [2).  Since  then 
extensive  work  on  GaAs/AlGaAs  QW  structures  has  been  done  by 
Holonyak  et  al[9,10l,  Tsang[ll,12)  and  Hersee  et  al  [13].In  recent  years, 
GaAs/AlGaAs  quantmn  well  lasers  have  been  of  great  interest  because 
of  the  special  properties  associated  with  the  two-dimentional-like  nar 
ture.  However,  due  to  the  great  number  of  parameters  involved,  the 
solution  to  the  problem  of  device  optimization  is  not  simple,  and  the 
discussion  about  the  choice  of  the  well  characterics  in  order  to  obtain 
the  lowest  possible  thereshold  current  is  still  open. 

In  the  following  chapters  anaiiytical  models  are  presented  to  char¬ 
acterize  the  MQW  ridge  structure.  These  include,  quantum  confined 


6-4 


carrier  energy  levels,  carrier  gain,  leakage  current,  nonradiative  recom¬ 
bination  current,  gain  comfinement  factor  and  finally  the  threshold  cur¬ 
rent  models.  In  chapter  III  calciilation  technique  and  optimal  device 
performance  model  for  Tninimnm  threshold  current  is  described.  Fol¬ 
lowing  this,  optical  gain  calculation  is  presented  in  this  chapter,  in¬ 
cluding  the  technique  to  optimize  the  structural  parameters  for  device 
performance  models.  Conclusions  are  presented  in  chapter  IV.  Finally 
computer  models  are  Usted  in  chapter  V. 
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Chapter  II 


Theoritical  Models  for  GaAs/AlGaAs 
MQWs'Application  to  Ridge  Structure 


II.l  Single  Quantum  Well  Energy  Levels 


The  quantum  size  effect  which  takes  place  when  one  of  the  sample 
dimensions  is  on  the  order  of  the  de  Broglie  wavelength  of  the  carriers 
is  now  well  known  [10,14] .  The  electrons  and  holes  experience  quanti¬ 
zation  in  the  confined  direction  and  their  energy  can  he  written  as[15]: 

h\ki + i?) 


£, '  m 


(1) 


t,h 


where  Ei  is  the  zth  confined  particle  energy,  m*  ^  is  the  electron  or  hole 
effective  mass,  h  is  Plank’s  constant,  and  and  are  the  x  and  y 
componenets  of  the  crystal  momentum  whose  quasi-contineous  values 
form  parabolic  subbands  minimum  energy  is  the  quantized  energy  level 
Ei.  The  origin  of  the  energy  is  taken  at  the  maximum  of  the  bulk 
valence  band  of  GaAs.  Assuming  a  symmetric  square  well  with  finite 
barrier  height,  the  energy  of  the  quantized  levels  can  be  found  for  MQW 
structure  by  solving  the  eigenvalue  equations: 


A  •  tan{  .4 


for  even  solutions; 


(2a) 
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A-cot{A~)  =  -B, 


for  odd  solutions. 


(25) 


where 


and 


{3a) 

(35) 


In  the  above  equations  m*  is  the  effective  mass  of  carriers  in  the  well, 
ml  is  the  effective  mass  of  the  carriers  in  the  barrier,  V  is  the  finite 
potential  barrier,  and  Z,  is  width  of  the  well. 


Figure  (1)  shows  schametically  the  energy  levels  of  the  electrons 
confined  in  a  quantum  well.  The  confine  particle  energy  levels  {Ei)  are 
denoted  by  Eic,E2c,E3c  for  electrons,  Eihh,E2hh,E3hh  for  heavy  holes  and 
Euh,E2ih,E3ih  for  light  holes.  Thus  in  a  QW  structure  the  energy  of  the 
emitted  photons  can  be  varied  simply  by  varying  the  well  width  I,.  In 
the  GaAs/GaAlAs  system,  the  mass  is  determined  as  a  function  of  the 
aluminum  composition  i  of  the  layer  as  [16]: 


mt  =  (0.067  -f  0.0835x)mo, 

(4a) 

m*  =  (0.48  +  0.31x)mo, 

(45) 
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where  m*  is  the  free  electron  mass  (the  light  holes  are  neglected).  The 
total  hetero  barrier  height  is  defined  as: 

AEgo  =  Eg{barrier)  —  Eg(weU) 

where 

Ego  =  1.424  +  1.247X,  if  X  <  0.45  (5a) 

Ego  =  1.424  4-  1.247X  +  1.147(x  -  0.45)^,  if  x  >  0.45  (56) 


Although  the  values  of  the  energy  discontinuities  of  the  conduction 
and  valence  bands  are  largly  discussed  [17],  this  model  uses  barrier 
height  Ve  =  0.58  •  AEgo  for  electron,  V),  =  0.32  •  AEgo  for  holes,  and 
V  =  Vt  +  V/k[8].  Equation  (2a)  and  (2b)  then  allow  the  determination  of 
quantized  levels  for  electrons  and  holes.  The  density  of  states,  constant 
within  one  subband,  is  given  by  [9]: 


8«.* 


(6) 


This  model  uses  the  classical  Fermi  statistics,  assiiming  that  elec¬ 
trons  and  holes  are  in  equilibrium  in  the  well  and  that  all  carriers  in 
the  same  band  are  characterized  by  the  same  quasi-Fermi  level.  They 
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are  determined  using  the  condition  of  electrical  neutrality: 

n  =  n«+P  (7) 

where  n,  is  the  p-type  impurity  concentration,  if  any  in  the  QW  layers. 
A  comparison  of  equation  (6)  with  a  regular  three-dimentional  density 
of  states  model  shows  that  the  density  of  states  in  the  QW  is  indepen¬ 
dent  of  carrier  energy  and  temperature.  The  modification  of  density  of 
states  in  a  QW  is  sketched  in  figure  (2). 


The  electron  density  n  is  obtained  by  integrating  over  kg  and  Jb,  in 
the  X  and  y  directions  and  by  summing  the  contribution  of  each  discrete 
subband.  Hence,  after  calculations,  n  can  be  expressed  as: 


(8) 


where  I,  is  the  well  width,  M  is  the  the  munber  of  quantized  levels  in 
the  well,  Ei  is  the  minimum  of  the  tth  subband,  and  E/c  is  the  Fermi 
level  for  electrons.  A  similar  expression  can  be  written  for  the  valence 
band  hole  density.  We  consider  only  the  heavy  holes  since  the  density 
of  states  of  the  light  hole  band  is  much  smaller  than  that  of  the  heavy 
hole  band  [18,19].  For  given  values  of  n  and  rxa,  (7),  (8),  and  the 
similar  equation  giving  p  allow  determination  of  the  Fermi  levels  Efe 
and  £/«  for  both  the  conduction  and  the  valence  bands. 
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II.2  Gain  in  GaAs/AlGaAs  MQWs 


To  examine  the  operating  characteristics  of  MQW  structures  theo¬ 
retically,  one  needs  to  study  the  gain  carrier  distribution  as  a  function 
of  photon  energy  and  its  dependence  on  the  injection  current.  The  gain 
distribution  under  on  (lasing)  condition  was  reported  to  be  slightly  in¬ 
homogeneous  (in  the  QWs)  for  an  undoped  active  region  and  to  be 
fairly  homogeneous  when  the  active  region  was  heavily  doped  [20,22]. 
The  so  called  ”band  tail”  was  also  well  explained  by  the  intra-band 
relaxation  process,  although  the  parabolic  (no  tail)  states  density  and 
wavenumber  conservation  rule  at  optical  transition  were  assumed  [23]. 
The  intra-band  relaxation  process  is  thus  one  of  the  most  important 
phenomena  characterising  the  gain  of  tne  laser  material.  Furthermore, 
gain  calculations  needs  a  numerical  integration  when  the  intra-band 
relaxation  time  is  taken  into  account,  as  was  shown  in  [20],  [22],  or  [23]. 

A.  Analytical  Model  for  MQW  Devices 

The  device  structure  shown  in  figure  (3)  is  used  for  modeling  the 
gain  of  MQW  structures.  The  optical  gain  in  QW  structures  with  dif- 
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ferent  well  widths  has  been  calculated  using  an  intraband  relaxation 
model  [18,23].  Detail  expression  for  this  model  is  shown  in  the  follow¬ 
ing.  However,  this  model  has  been  chosen  on  one  hand  because  the 
bell-like  spectral  shape  of  the  gain  curves  [18]  rules  out  the  k-selection 
rule  model,  and  on  the  other  hand  because  the  reported  experimen¬ 
tal  differential  gain  values  [19-24]  are  higher  than  those  calculated  by 
the  no-selection  rule  model.  In  addition,  the  gain  calculated  using  the 
chosen  model  agrees  well  with  experiment  reported  in  the  hterature[6]. 
According  to  the  intraband  relaxation  model  [18,23],  the  optical  gain, 
g{E),  is  calculated  by  the  following  expression: 


m-l  ,oo 


g(E)  =  const'^  j 


Ben’¥B<,n‘¥Bf0 


Pred  |M|' 


with 


(/c  fv)^lTin  ^  \dE 


(9) 

m 

(11) 


(12) 
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(13) 


=  £=»  +  —!  E„  -  -  Evn  -  E. 


m. 


go 


(14) 


|Mo|^  =  1.33m„f;j„ 


(15) 


Pred  = 


^}-'’(16) 


Finally  the  term 

35e2 

const  =  — T - r 

mjceo/i 


(17) 


where  Ecn,E„n  represent,  respectively,  the  quantizd  carrier  energies  in 
the  conduction  and  valence  bands,  AEi  =  Ec\  +  F„i  and  F,„  =  Ecn+E^n+ 
Ego  with  Ego  representing  the  band  gap,  is  the  reduced  mass  (1/mr  = 
1/mc  +  l/mu),  Ecv  represents  the  transition  energy,  E  corresponding  to 
energy  of  emitted  photon,  and  Tin  is  the  intraband  relaxation  time  of  the 
electron  and  hole  pair.  Other  terms  such  as  fe  and  fp  represents  Fenni- 
Dirac  distribution  functions  for  electrons  and  holes  respectively,  \M\^e 
and  represents  transverse  electric  and  transverse  magenetic  gain, 
\M\l  is  the  dipole  moment,  pru  is  the  radiative  recombination  rate,  and 
fin2illy  p  is  average  refractive  index  along  the  j -direction  of  the  QWs. 
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For  a  given  carrier  density  (n),  the  conduction  band  quasi>fenni 
level  (Efc)  is  calculated  by: 


Neglecting  the  residual  doping  in  the  layers  and  considering  the  charge 
neutrality  criterion: 

n=p.  (19) 

Furthermore,  valence  band  quasi-fermi  level  {Ef„)  is  obtained  by 


Optical  gain,  g{E),  is  plotted  as  a  function  of  photon  energy  for  a  given 
injection  current  in  figure  (5).  The  maximum  gain,  value  versus 

energy  (E)  at  a  given  injection  (7)  is  plotted  versus  the  injection  current 
(J)  in  figure  (6).  It  is  expected  that  as  the  injection  current  increases, 
the  staircase  density  of  states  in  QW  structure  induces  first  a  rapid 
gain  increase  and  then  a  saturation  profile  at  high  injection,  which  is 
interrupted  by  the  contribution  from  the  second  quantized  level.  The 
influence  of  this  specific  gain-current  behavior  on  the  threshold  current 
will  be  examined  later. 
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Injected  Current  [mA] 

Figure  (5) 


II.3  Leakage  Current 


A.  Current  in  QW  Ridge  Structure 


The  calculation  of  lateral  leakage  current  {I,  +  Idi/f)  is  performed 
using  the  same  formulism  as  in  the  model  developed  by  Joyce  [25].  The 
two  components  and  I^iff  of  the  lateral  leakage  current  are  illustrated 
in  figure  (6).  The  first  component  is  an  ohmic  sheet  current  /,(A/cm) 
(spreading  current),  which  flows  in  a  thickness  h  of  the  upper  p-layer 

s 

with  sheet  resistance  Q  -  p/h;  another  component  is  a  diffusion  current 
^diff{A/cm^),  which  flows  in  the  active  layer  of  thickness  d  -  M  ‘  L,. 
In  oxu*  calculation,  Joyce's  model  is  adopted  to  the  QW  structure  by 
introducing  the  gain  curves  of  section  II.2. 

B.  I,  and  Idifj  Neglecting  Nonradiative  Rcombinations 

As  shown  in  figiue  (6)  for  the  optical  cavity  region  beyond  the  strip 
contact  (x  >  ty),  there  are  two  components  to  the  lateral  carrier  flow. 
The  first  component  is  an  ohmic  sheet-current  density  /,  (A/cm),  which 
flows  in  a  thickness  h  of  the  p-layer  and  is  driven  by  the  p-layer  voltage 
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Width 


density,  and  J(A/cra  )  is  the  injected  current 


gradient  dVIdx,  i.e. 


dV(x) 


where  ft  (ohms-cm)  is  the  p-layer  resistivity  and  p  (ohms/square)  is 
the  sheet  resistcince.  The  sheet  current  density  /,  leaves  the  p-layer 
becoming  the  active-region  injected-ciirrent  density  JiAfcw}),  i.e., 


=  -7. 


It  is  assumed  that  the  evolution  of  the  electron  concentration  n(cm~^) 
in  the  active  layer  can  be  described  by  the  diffusion- type  equation: 

•'  (23) 


where  the  effective  diffusion  coefficient  D(cm^/3)  may  be  either  a  con¬ 
stant  or  the  concentration-dependent  variable  D{n).  To  preserve  charge 
neutrality  the  hole-diffusion-current  density  I^,ff{A/cm~'^)  and  the  electron- 
diffusion-current  density  -Idiff{Alcm^)  sum  to  zero  lateral  current  flow 
in  the  active  region.  The  diffusion  current  has  as  its  source  the  injection- 
current  density  J  and,  as  its  sink,  the  concentration-dependent  recom¬ 
bination  rate  R{n){cm~^  f s);  that  is, 

where  q  is  the  magnitude  of  the  electron  charge  and  d  is  the  active-layer 
thickness.  .A-s  a  first  order  approximation  any  effect  of  the  typically 
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smaller  n-layer  resistivity  is  neglected;  thus  the  x-dependent  p-layer 
voltage  V  is  also  the  voltage  across  the  active  layer.  The  above  equa¬ 
tion  must  be  solved  subject  to  the  self-consistent  condition  that,  at 
each  point  i,  the  voltage  V  be  a  unique  function,  V'(n),  of  the  local 
injected  concentration  n.  Explicit  fimctions  for  F(n)  and  R(n)  will  be 
introduced  at  a  later  stage,  but  controversy  still  surroimds  the  choice 
of  these  functions  which  are  determined  in  a  given  heterostructure  by 
the  active  region  doping,  by  the  amount  and  type  of  the  nonradiative 
recombination.  The  above  equations  hold  true  for  i  >  u>  (beyond  the 
strip)  but  by  alternatively  regarding  J  as  the  known  function,  one  can 
find  the  solution  under  the  strip.  The  additional  sink  term: 

in  the  above  model  represents  a  mode-stimulated  recombination  effect 
as  well.  Thus  this  model  is  applicable  up  to  thershold  or,  above  thresh¬ 
old,  to  the  regions  beyond  the  mode  saturation.  However,  neither  loss  of 
confinement  (carrier  escape  into  the  ternary  layers)  nor  inhomogenous 
temperature  effects  are  included  in  this  model.  Boundary  conditions 
at  X  =  oo  include  V  =  1^  =  J  =  Idiff  =  dn/dx  =  0,  and,  by  definition 
n  =  Tioo-  Several  authors  [23-28]  have  sought  a  description  of  lateral 
current  spreading  in  heterostructures  starting  from  equations  that  are 
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various  particular  cases  of  the  model  presented  here.  In  each  case  the 
carrier  flow  in  the  active  layer  was  assumed  to  result  only  from  diffusion, 
i.e.,  any  field  driven  carrier  motion  is  tacitly  neglected.  This  neglect 
does  not  appear  to  be  accurate  always,  although  is  often  is,  particularly 
for  an  n-type  active  layer.  The  carrier  transport  in  the  active  layer  can 
be  adequately  modeled  with  difliision  equations  as  explained  earlier. 
Although  this  model  is  similar  to  the  model  of  Joyce  [25],  the  following 
modifications  have  been  introduced  in  our  calculation. 

(1)  Stimulated  recombination  is  included  as: 

where  P  represents  optical  power  density;  g[n(x)]  is  the  gain 
expression  for  the  carrier  density  n  at  position  x  calculated  as 
before  for  QW  structures;  hw  is  the  photon  energy. 

(2)  The  specific  voltage  expression  (V(n))  in  QW  structures  is  used: 

V{n)  =  [Ej,  +  Ef,  +  E,o)lq  (25) 

with  EfcEfv  calculated  as  shown  in  section  II.2. 
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For  narrow  stripes,  (tV  =  A^m  in  our  calculations),  the  assumption 
of  a  constant  injected  current  density  J  =  Jo  under  the  strip  can  be 
adopted.  Then  one  has  to  solve  the  above  equations  for  each  region 
separately:  imder  the  strip  and  beyond  the  strip.  Nevertheless,  the 
analytical  solution  used  by  Joyce  cannot  be  applied  because  of  the 
introduction  of  the  stimulated  term  in  the  above  equation.  Therefore 
the  Runge-Kutta  numerical  technique  shall  be  adopted  to  solve  the 
coupled,  nonlonear,  nonhomogenous,  differential  equations. 

C.  Nonradiative  Recombination  Current 


The  above  models  does  not  account  for  nonradiative  recombination 
effects  for  the  calculation  of  the  thershold  current  for  lasing  action. 
However,  for  comparision  with  the  experimental  results,  nonradiative 
recombinations  must  be  included.  Thus,  equation  (24)  is  rewritten  as: 


dl 


diff 


dx 


(26) 


where  is  the  nonradiative  recombination  lifetime.  Once  again  the 
Runga-Kutta  numerical  technique  could  be  used  for  solving  the  diffu¬ 
sion  current  equation  mentioned  earlier.  Then,  /,  and  Idiff  ,  including 
the  nonradiative  recombination,  are  obtained  where  the  approximation 
is  made  and  the  confinement  factor  in  the  y  direction  is  approximated 
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by  Fy.  y[n(x)]  is  calculated  in  the  above  using  the  effective  index  method 
in  the  case  of  the  ridge  waveguide  structure  of  figure  (2),  we  calculated 
the  E{x)  expression.  The  effective  index  variation  [26]  induced  by  the 
carrier  injection  is  calculated  for  the  modal  gain  calculation.  It  must  be 
noted  that  the  absorption  in  the  region  below  transparency  is  included 
in  the  above  equation,  where  the  expressed  gain  is  the  net  modal  gain, 
(gmod)  in  the  optical  cavity. 

D.  Auger  Recombination  Current 

Auger  recombination  ciurent  (/««,er)  is  modeled  as  [27]: 

=  gMI,C.np2  (27) 

with  Ca  =  2  •  10"^®cmV3  for  the  CHSH  Auger  process.  For  other  Auger 
processes  see  appendix  A.  The  optical  cavity  recombination  current 
{Jcav)  is  presented  as  [28]: 

Jcav  —  3^eav(peav  Po^  (28) 

with  5  =  9- 10““ cm^/5,  Lg  represents  the  optical  cavity  width,  the 
doping  level  of  the  cavity,  and  neav,Pcav,  respectively  the  electron  and 
hole  desity  in  the  optical  cavity  evaluated  in  a  3  -  Z?  configuration  from 
the  relative  position  of  the  Fermi  level  in  the  QWs. 
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II.4  Threshold  Current 


In  this  section  a  general  expression  of  threshold  current  (I,*)  for 
MQW  structure  wil  be  established.  This  current  is  composed  of: 

(1)  a  radiative  recombination  current  Jr{A/cm^)  so  that  the  modal 
gain,  (gmod)  overcomes  the  cavity  losses, 

(2)  a  lateral  spreading  current  in  the  upper  confinement  layer  I,{A/cm)^ 

(3)  a  lateral  diffusion  current  in  the  active  layer  Idiff{A/cm^)y 

(4)  a  surface  or  interface  nonradiative  recombination  current,  Jnr{A/cm^), 

(5)  an  Auger  recombination  current  JauseriA/cm^),  and 

(6)  an  opticcd  cavity  recombination  current  Jeav(A/cm^)  due  to  ra¬ 
diative  recombination  carriers  leaking  in  the  optical  cavity. 
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Hence,  the  threshold  current  is  expected  as: 


Ith  —  {<^r(ldr5  —  9mod)  "J*  •^nr  <^auger  •^cav}^-^ 


+  2IgL  +  2Iiiff  •  MLfL 


(29a) 


with 


(296) 


Neglecting  the  nonradiative  recombination  current,  the  threshold  cur¬ 
rent  in  the  structure  is  approximated  as: 


^tk  =  (Jo  +  Jaugtr  +  Jcav)^L  +  21, L  (29c) 

where  the  lateral  diffusion  current  /*//  is  included  in  Jo{=  Jr  +  J)-  To 
calculate  the  threshold  current,  the  following  process  is  used.  For  a 
given  injection  carrier  density  at  the  strip  edges  (n*): 


(1)  Jo  and  I,  are  calculated  as  described  before. 


(2)  Jaugtr,Jcav  are  Calculated  as  given  in  steps  (3-4)  below. 


(3)  L*  can  be  calculated  by  rewriting  the  threshold  condition: 


r  =  ln{l/Rx  ■R2) 


-1 


(30) 
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(4)  The  threshold  current  is  obtained  as  per  equation  (29). 


The  following  parameter  values  are  used  in  the  calculation: 

1.  Optical  power  desity:  P  =  lO^W/cm^; 

2.  Sheet  resistivity:  p  =  0.50  Qcm,  and 

3.  Diffusion  cofRcient  in  the  active  layer:  D  =  lOcm^/j, 

4.  R\  -  0.3  and  =  0.8  (one  facet  is  uncoted  and  other 
is  with  treatment), 

finally 

5.  a,  represents  cavity  losses. 

II.5  Calculation  of  Modal  Gain  {gmoi) 

In  this  study  the  modal  gain  is  calculated  by: 


~  E‘{z)dx 

where  the  approximation  5[n(x,  y))  ~  y(n(x)]  •  y(n(y)J  is  made  and  the  con¬ 
finement  factor  in  the  y-direction  is  approximated  by  Ty.  y(n(x)]  is  cal¬ 
culated  as  in  the  earher  expression  for  g{E).  Using  the  effective  index 


method  in  the  case  of  the  ridge  waveguide  structure  E{x)  is  calculated. 
The  effective  index  variation  [16]  induced  by  the  carrier  injection  is  es¬ 
timated  to  be  less  th2Ln  0.01  percent  and  is  consequently  neglected  for 
the  modal  gain  calculation.  It  must  be  noted  that  the  absorption  in 
the  region  below  transperency  is  included  in  the  previous  expression, 
where  the  expressed  model  gain  is  the  net  modal  gain  in  the  optical 
cavity. 

II.6  Mode  Confinement  Factor  for  MQWs 

One  of  the  main  difference  between  the  SQW  and  MQW  is  that  the 
confinement  factor  (F)  of  the  optical  mode  is  significantly  smaller  for  the 
former  than  that  for  the  latter.  This  can  result  in  a  higher  threshold 
carrier  desity  and  in  higher  threshold  current  density  of  SQW  lasers 
when  compared  with  the  MQW  lasers.  The  confinement  factor  of  the 
fundamental  mode  for  a  SQW  is: 

r=DV(2  +  (32) 

with 

-4.  (33) 

is  found  to  be  accurate  to  within  1.5%  [29].  In  the  above  expression  d 
is  the  active  layer  thickness.  !•<,  =  2k/Xo  where  A*  is  the  wavelength  in 
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free  space  and  na  ,  nc  are  the  refractive  indicies  of  the  active  and  the 
cladding  layer,  respectively.  For  smaller  active  layer  thickness  D  «l 
and  r  is  expressed  as[30]: 

r  =  -  fil)  ■  (34) 


The  mode  confinement  in  MQW  structures  can  be  analyzed  by  solv¬ 
ing  the  electromagnetic  wave  equation  for  each  of  the  layers  with  the 
appropriate  boundary  conditions.  This  procedure  is  quite  tedious  and 
Streifer  et  al.[31]  have  shown  that  the  following  simple  formula  gives 
reasonably  accurate  results; 


r(MQW)  =  7 


Nad^ 

Nad.  +  Nidi' 


(35) 


where 


7  =  2ir^{il^ 


N.da  +  N^dk 


(36) 


and 


Ngdafig  +  Njdim 

Ngda  +  Nid^ 


(37) 


iVa,  iVj  are  the  number  of  active  and  barrier  layers  in  the  MQW  structure 
and  da,  dk  {fig  and  /it)  are  the  thickness  (and  refractive  indicies)  of  the 
active  and  barrier  layers,  respectively,  jl  is  the  average  refractive  index 
of  the  uniform  optical  mode  in  the  MQW  active  region.  Hence  the 
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7  is  the  confinement  factor  of  the  optical  mode,  r  is  thus  obtained 
by  multiplying  7  by  the  ratio  of  the  total  active  thickness  to  the  total 
thickness  of  the  active  and  barrier  layers. 
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Chapter  III 

Calculation  And  Optimization 


Figure  (3)  is  used  for  chracterization  of  MQW  stucture  under  inves¬ 
tigation.  At  this  stage  simple  calculations  are  performed  to  predict  the 
device  performance  using  the  models  of  chapter  II.  However,  extensive 
calculations  are  required  essentially  to  identify  the  model  limitation,  if 
any,  and  use  realistic  values  for  the  parameters  to  compare  the  experi¬ 
mental  data  with  the  theorical  calculations.  Furthermore,  optimization 
of  (1)  device  performance,  and  (2)  structural  parameters  for  device  ge¬ 
ometry  are  beyond  the  scope  of  this  investigation.  Although  a  scheme 
has  been  identified  for  optimization  in  general.  Furthermore,  using  the 
models  of  chapter  II  an  extensive  study  can  be  conducted. 

For  our  immediate  interest  in  device  characterization,  an  eigenvalue 
problem  (quantum  confined  carrier  energy  levels)  is  solved  numerically 
using  the  Newton- Rapson  iterative  technique.  Furthermore,  the  QW 
barrier  height  is  so  small  (even  in  the  conduction  band  discontinuty) 
it  does  not  permit  more  than  one  quantized  energy  level  in  each  of 
the  conduction  and  valence  bands  of  GaAs/AlGaAs  hetero  structure. 
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Therefore,  the  model  calculations  are  Umited  to  a  single  quantized  en¬ 
ergy  level  in  the  QWs.  Therefore,  the  behavior  of  one  of  the  most 
important  functions,  namely,  the  nonlinear  gain  (^m.*(n)),  is  not  ap¬ 
parent  from  these  calculations. 

Fourth  order  Simpson’s  rule  is  employed  for  numerical  integration 
of  the  gain  expression.  Notice,  for  a  given  QW  layer  thickness,  the  gain 
spectnun  is  shown  in  figure  (4).  Figure  (5)  is  basically  the  variation  in 
maximum  gain  as  a  function  of  injected  cmrrent.  From  this  preliminary 
calculation,  it  is  evident  that  an  abrupt  transition  (from  zero  gain  to  a 
finite  gain)  occurs  just  above  the  band  gap  energy,  exhibiting  the  low¬ 
est  energy  quantization  effect.  Furthermore,  a  little  above  the  abrupt 
transition  energy  level,  there  is  a  sharp  peak  representing  the  maximum 
gain,  with  bell-shaped  spectrum  of  the  system  in  general.  This  is 
followed  by  gain  harmonics  represented  by  multiple  peaks  of  relatively 
smaller  magnitude.  By  changing  the  well  size  (QW  layers  thickness  I,) 
the  gain  spectrum  reveals  a  slight  shift  in  energy  level  for  maximum 
gain.  Calculation  accuracy  is  essential  to  see  the  pronounce  shift  in  its 
energy  for  maximum  gain  conditions.  Although  the  linewidth  of  the 
gain  spectrum  is  very  narrow,  but  the  overall  profile  of  the  peak  looks 
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like  a  bell-shaped  curve.  It  is  also  evident  that  maximum  gain  is  three 
orders  of  magnitude  larger  than  its  baseline  effect. 

III.l  Threshold  Current  Calculation 

For  the  calculation  of  threshold  current  one  needs  to  solve  the  differ¬ 
ential  equations  with  the  boimdary  conditions  for  the  models  described 
in  chapter  II.  These  differential  equations  are  coupled  nonlinear  and 
nonhomogenous  with  multipoint  boundary  conditions.  In  general,  this 
requires  a  technique  similar  to  two  point  boundary  value  problems  to 
calculate  the  threshold  current.  The  Shooting  method  needs  to  em¬ 
ployed  to  solve  the  multipoint  boundary  value  problem  inconjunction 
with  the  Runga-Kutte  technique  to  solve  the  differential  equations. 

As  a  first  order  model  approximation,  the  injection  current  (source 
term)  is  used  to  establish  the  boundary  value  requirements  for  max¬ 
imum  free  electron  concentration  in  the  optical  cavity  under  the  in¬ 
fluence  of  ridge  waveguide  of  the  QW  structure.  Furthermore,  on  the 
outer  boundaries  of  the  optical  cavity,  intrinsic  carrier  concentration 
of  GaAs  is  assumed  as  boundary  conditions  imposed  on  the  system. 
These  realistic  assumptions  help  in  establishing  the  carrier  concentra- 
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tion  profile  in  the  active  region  to  estimate  the  sheet  current,  /,,  and 
diffusion  current,  Uiff.  In  our  investigation,  a  Gaussian  distribution 
is  assumed  for  the  carrier  concenration  in  the  active  region  of  optical 
cavity  along  i -direction  with  the  boundary  conditions  as  mentioned 
before.  Once  gain,  it  is  estimated  that  the  Auger  and  cavity  recombi¬ 
nation  currents  are  expected  to  be  no  more  than  20%  of  the  radiative 
recombination  current,  Jr.  However,  its  contribution  will  not  be  more 
than  2%  of  the  total  threshold  current,  i.e.,  threshold  current  due  to 
radiative  recombination  (including  the  cavity  losses)  and  nonradiative 
recombination  effects  /:  // 

III.2  Design  Optimization 

Neglecting  the  Auger  and  cavity  recombination  effects,  just  to  inves¬ 
tigate  the  structural  parameters,  we  could  approximate  the  expression 
for  threshold  current  as: 

hh  =  |7o(M).W  +  2/,|  r  (38) 

However  there  exist  a  relationship  for  optimal  cavity  length,  L*  as: 

In 

2(r{M)g„,.  -  o,) 
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where  £*  represents  an  optimal  optical  cavity  length.  Since  o,  <<  T{M)gmax 
and  considering  that; 


Jo{M)  =  MJo-sqw 


(39) 


we  have 


M 

“r(M) 


Jo- 


SQW 


2£.l 

M  (  ■ 


/n(l/i?  R2) 


2( 


I  nas 


(40) 


Jo-sQw  represnts  the  injection  current  in  SQW  layers.  This  expression 
shows  that  is  a  nonlineax  hmction  of  M,  r(M),  and  Furthermore, 

the  net  effect  of  /,  can  be  reduced  significantly  by  increasing  the  number 
of  QW  layers.  Fine  tuning  '■f  this  code  is  essentional  to  exploit  the 
power  of  this  model. 


6-36 


Chapter  IV 


Conclusions 

Analytical  and  computer  models  are  developed  to  estimate  the  in¬ 
fluence  of  each  parameter  for  the  calculation  of  the  threshold  current. 
However,  realistic  data  of  various  parameters  are  required  to  fully  un¬ 
derstand  their  effects  and  design  for  optimal  performance.  These  pa¬ 
rameters  can  be  summarized  in  the  following: 

(1)  confinement  factor, 

(2)  ridge  structure  parameters, 

(3)  QW  structure  parameters, 

(4)  role  of  leakage  current  componenets  on  the  threshold  current. 
As  an  example,  the  confinement  factor  is  a  function  of: 

number  of  quntmn  well  layers, 

QW  layer  thickness  of  the  active  region. 

X  :  Aluminum  fraction  in  the  barrier. 

Y  :  Aluminum  fraction  in  the  buffer  layers  for  gain  confinement. 
Li :  QW  barrier  width. 

Lg  :  Optical  cavity  width. 

\V  \  strip  width. 


6-37 
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h  :  thickness  of  active  material  in  the  MQWs. 

Regarding  the  ridge  structures,  one  needs  to  investigate  the  parameters 
associated  with  the  optical  cavity  and  numbers  of  QWs  as  listed  below. 

(i)  Optical  Cavity  Parameters 

This  is  once  again  function  of  the  following: 

X  :  Aluminum  fraction  in  the  barrier. 

Y  :Aluminum  fraction  in  the  buffer  layers  for  confinement  effect. 

:  QVV  barrier  width. 

Lg :  Optical  cavity  width. 

(ii)  Influnce  of  Well  Number  (M)  . 

In  this  case,  for  each  value  of  M  one  may  find  an  optimal  value  of 
L  =  L*(A/,  Lt)  corresponding  to  the  minimum  value  of  /i*.  As  a  matter  of 
fact  the  existence  of  this  optimal  value  A/*(L,)  for  a  given  L,  is  explained 
earlier  in  section  II.2,  ’’Design  Optimization”.  However,  investigation  is 
needed  for  the  threshold  ciurent  as  a  function  of  cavity  length  {Itk{L))  for 
a  given  value  of  I,.  For  parameter  optimization  of  QWs  see  steps  (3a) 
and  (3b). 

(3a)  Optimization  of  Well  Width  Lz  For  Specific  M 

By  repeating  the  calculation  of  step  (2)  for  a  different  I,,  a  set  of 
{M*{Lz),Lz]  pairs  can  be  obtained. 
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(3b)  Optimization  of  A1  Fraction  In  Cladding  Layers 

This  program  permits  for  variation  in  Al  fraction  Y  in  the  cladding 
(buffer)  layers.  However,  different  values  of  Y  could  lead  to  sensitive 
variations  for  the  optimal  threshold  current. 

Finally  role  of  leakage  current  would  shed  some  light  on  threshold 
current  requirements. 

(4)  Leakage  Current  Effects 

Leakage  current  components  such  as  {Idiff,  hiff,  huger,  hav)  can  be 
investigated  for  various  parameters  listed  earlier  using  this  computer 
code. 

(5)  Direction  For  Further  Investigations 

One  can  extend  this  work  to  model  the  entire  optical  switch  which 
consists  of  a  main  laser  and  two  side  lasers  once  the  optimized  geome¬ 
try  is  established  as  shown  earlier.  Before  this  can  be  done  some  fine 
tuning  of  the  computer  model  is  essential,  which  requires  structiiial 
parameters  and  experimental  data  to  verify  the  predicted  results. 
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Chapter  V 


Computer  Program  Listing 

Standrad  Fortran  77  language  is  used  to  develop  this  code.  How¬ 
ever,  this  code  can  work  on  Apollo  3500  or  DNK  10000  machine,  under 
Unix  operating  system.  Initially  this  code  was  developed  for  IBM  pc, 
which  requires  at  least  a  day  to  perform  the  calculation  for  gain  spec¬ 
trum  of  figure  (4).  Therefore,  this  code  is  transported  to  AppoUo  3500 
and  computational  time  was  significantly  reduced  to  3  hours.  However, 
about  one  hour  is  required  to  run  this  program  on  DNK  10000.  Further 
testing  of  this  software  is  required  to  make  sure  that  all  the  bugs  are 
out  and  program  is  efficient. 
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program  lasergarn 

implicit  reai*8 (a-h, o-z) ,  integer*4 ( i-ni 
real's  mo, k,me,mhh,mlh, Iz.men.mehx.mx 
character*!  aa(51) 

dimension  ee(4) ,hh(4) , eh(4) , egn( 4) , gte (200,4), 

1  tt (2, 512, 512) , xnuuc(2) , xmin(2) , afl (2) , xxnn(25) , xxpp(25) , 

2  aeven(2) , aodd(2) , sum(2) ,xxlz (10) ,gm(25,5), xjcav(25) , xjaug (25 ) , 

3  v{25) , xidif (25) ,xj (25) , xis (25) , xdum(25) , wwlz (6) , cj (25, 2) 
common  /a/pi,c,epo,uo,h,hb,q,mo,  <,me,mhh,mlh,er,  ego, vo, voo,  Iz 
common  /b/meh,mehx, enl 

common  /c/mx,evj,p2 
common  /d/ee,hh,ieh 

common  /e/xme,xmh,  hbl,  hbl2,  pil,  pil2 , x)ct ,  xlz,  ec,  ev,xnn,  xpp 

common  /f /tt, )c)c,inm 

common  /g/gte, xmax, xmin, aa 

common  / input/xlength, xwidth, x'<^, wlz, wlb, buffp, buffn, ddd, 

1  pdphot , rsheet, rrl, rr2, difc, alpi, cinj , xwd,xxnn,xxpp,xnpo,xnpi, 

2  xno,xpo,cinl,cin2,a,nwell,nnx 

This  file  is:  aqw. test. 3.1 
Updated  on  2/4/1992 

open(5,  files'  //heplOlc/user/jchaia/aqw.test.l  .in' ,  statuss'old' ) 
open(6,  files'  //heplO)c/user/lcha]a/aqw.tesr  .3  .l.out' ,  statuss'new' 

1  ) 

open (7 , files' //heplOk/user/)chaja/aqw. test .3 .1 .plot .out' , 

1  statuss'new') 

(56789012345678901234567890123456789012345678901234567890123456789012 

write(6,')' 

write(6,*)'  Source-code  file:  aqw. test. 3.1  ' 
write(6,')'  Output  f ile :aqw. test . 3 . 1 .out  ' 
write(6, *) '  ' 

xsO.dO 
refasref (x) 
xs0.2d0 
refbsref (x) 
xs0.40d0 
wlbs30.d-9 
wlgslOO .d-6 
refwsref (x) 
wwlz ( 5) s300 .d-9 
wwlz ( 4 ) s250 . Od-9 
wlz  (  3  )  s200  .d-9 
wwlz (2 ) sl50 .d-9 
wwlz ( 1) slOO .d-9 
voosO . 12d0 
VOS voo 
Cs300 
djs5 .d-5 
ICsO 
nweilsA 

l^Xsl 

in3ps25 

in^p  must  be  25 
nnxs25 
xnpis2 . Idl2 

xnposl.dS/ (50.d0*400.d0*1.6029d-19) 
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inwla3 

buf fp=200 .d-9 

do  900  nwssl.nwsize 

Izswwiz (nws) 

wlz=lz 

xiz=lz 

do  855  inweilsl, inwl 
nw«il=inwell*2 
dddsdfloac (nwcll) *lz 
cinjrO .dO 
Izpaiz'l .d9 
itesc=0 

write(6,*)'  Lz= ' ,  Izp,  '  [nm) ,  r.weil= ' ,  nwell 
wrice(6,*)'  Indax:  (GeiAs)  = ' ,  refa,  ' ,  (AlGaA)  =  ' ,  refb, 

1  (p)=',refw 

write(6,*''  Kinj)  I(th)  <Idif>  <Is>  <J> 

1  '  <gniod>  <Iaug>  <Icav>' 

i  456789012345678901234567890123456789012245678901234567890123456789012 
do  850  injsijx.injp 
itesc=icest>l 
cinjscinj+dj 
call  input 
ic=ic+l 

if ( itest .gt . l)go  to  25 
call  elece(ic) 

thicn=lz*df loat (nwall) ♦wlb*df loat (nwell-l) +buf fp 
refaves  ( lz*r*fa>wlb*r«fb-*'buf  fp'refw)  /thicn 
xnbarsl .do 
25  continue 

do  800  innal.nnx 
xnnsxxnn ( inn) 
xppsxnn 
xs.2 

xm«s£me(x) 
xznhs£hh(x) 
hblrhb 
hbl2=hbl*hbl 
pil=pi 

pil2=pil*pil 
taws 1 .d- 12 
xhtshb/taw 
xht2sxht*xht 
evj2sevj  *evj 
ev j 1 2  sdsqr t ( evj ) 

xconsts3d0*hb*q*q/  (xine*jan«*c*epo*xnbar ) 
xJttsJc't 

xznosl  .33  3  *mo*®go*evj 
xrmsxma'xmh/  (xnw^xmh) 
nsO 

XRiaxd)  s-1  .dlO 
xinin(  1)  si  .dlO 
xinax(2)  s-l  .dlO 
xniin(2)  si  .dlO 
50  nsn+l 

ecns®«(n) 
ecsego 
•vnshh(n) 

«vs0 .dO 

•gn ( n ) sacn^evn^ego 
agnnsagn(n) 
avj 1 savj 
aCcsa2inv(avjl) 
eCvsa4inv(av3l) 
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ascgnn 

al=a 

bl=l.SdO»a 
b2=100 .dO*a 

iO  iTuns2000 
Irani =niii»  I 

dex2=(b2-a)  /df  loat  (mini ) 
ijk=200 

dexl= (bl-al) /dfloat ( ijk) 

d«nlsee(n)  -^hhtn) 

xscsO .do 

xev=0 .dO 

do  200  i=l,ijk 

ephonsal'»-dexl*d£loaC  ( i-1) 

xps . OdO 

afl(l)=0.d0 

afl(2)=0.d0 

aeven(l) =0.d0 

aodd( 1) =0 .dO 

aeven ( 2 ) =0  .dO 

aodd(2) =0 .dO 

do  100  jrl.nanl 

ecvx=egn(n)  ■*'dex2*df  loat  ( j-1) 

if (ecvx.gt ,bl)  go  to  101 

xl=red(xnn,denl,  ephon, egnn) *evj 12 
eipcn=ecn+  (ecvx-egn(n)  )  *xnn/xn\e 
x2 1 sxmt e ( xmo , ecn , e ipcn ) 
x22sxintin(xmo.  acn,  eipcn) 
x3=fc|xrm.xin«,*cvx,egnn,xec,  efc.xkt ) 
x4s£v(xnn,xmh, acvx,  «gnn, xav, afv, xkt ) 
x5a (acvx-ephon) *evj 
x6ax5*x5+xht2 

xtesxl*x21* (x3-x4) *xht/ (x6*ecvx*evj ) 
xtinaxl*x22*  (x3-x4)  ‘xht/  {x6*ecvx*ev3  ) 
if  ( j  .eq.Dgo  to  68 
if (j .ne. Irani )go  to  70 

i8  afl{l)=afl(l)+xte 
af 1 (2) aafl (2) ^xtm 
go  to  80 

'0  if { j .na. j/2*2)go  to  75 
aeven  ( 1 )  =aevan  ( 1 )  4-xte 
aeven  ( 2 )  =aeven  ( 2 )  -^xtm 
go  to  80 

5  aodd(  1)  =aodd(  1) -t-xte 
aodd ( 2 ) =aodd ( 2 ) 

lO  continue 

if (xte. It .xp)go  to  100 
xpsxte 

lO  continue 

1  continue 

do  102  j=l,2 

suin(  j  )  =af  1  (  j  )  *4  .d0*aodd(  j  )  ♦2  .dO 'aeven  i  ■} ) 

2  3Uin(  ]  )  =3uin(  i )  •dex2*ev^  'xconst/  3  .  dO 
xxx=suin(  1) 

gte(i, 1) =ephon 

gta(  i ,  2 )  =suin(  1) 

gte(  i.  3  )  =suin(2) 

if  (XXX. la .xmaxd)  ) go  to  110 

xmax ( 1 ) axxx 

imaxlai 

go  to  115 
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110  if  (xxx.ge.xinin(l)  )go  co  US 
xmind)  =xxx 
115  continu* 
xxxssuni(2) 

if  (XXX.  le.xinax(2)  )go  co  120 
xmax(2) sxxx 
inuuc2si 
go  Co  200 

120  if  (xxx.ge.xinin(2)  )go  co  200 
xjnin(2)  =xxx 
200  continue 

xl=df loat (nwell) *wl2 
x2sdf loat (nwell-1) *wlb 
x3sxl•^x2 

avrnus  (xl*refa+x2*refb)  /x3 

gaina«2.d0*pi*pi*{x3**2)  *  (avniu»-2-refw**2) 

xleunbshb*c/  (gte(  imaxl,  1)  *evj  ) 

g^unasg^uoa/xlamb/xlamb 

c  f =gaina ‘xl /x3 /avinu 

xll=wlz 

xw=vo 

if { inn.gt . 0 ) go  co  430 
write(6, *) ' 

write(6,*)'  *•*  gain  spectrin  '*•  ' 

wriCe(6. *) ' 

write(6,*)'  Lz=',xll,'  [m]  ,  Vc=',xw.  '  {eV]  * 

write (6,*)’  n  =',xnn,'  [m3],  p  =',xpp, '  [m3]' 

write(6,*)'  T=',c,'  [oK] ' , ' ,  -well', nwell 

do  40C  i=l,ij}c  . 

gte(i,2)agte(i,2) *cf 

gte(i,3)=gte(i,31 *cC 

if (inn.gt. l)go  co  400 

write (6, *)gte( i.l) , gte ( i, 2) , gtei i, 3 ) 

400  continue 
ktemsl 
imeucs  imaxl 

420  call  plot ( i jk, ktem, imax) 
write(6, *) ’  ’ 
ktems2 
imaxsiinax2 

call  plot (ijk, ktem, imax) 
write (6, *) '  ' 

430  continue 

xppsxxpp ( inn ) 
e£v=a4inv(evj 1) 
v(  inn)  sefodabs  (efv)  -fego 
cj (inj, l)=cinl*1000.d0 
cj (inj,2)=cin2*1000.d0 
gm(  inn.  1)  =gte(  im^ucl,  I) 
gm  ( inn,  2 )  =gte  ( inuutl ,  2 ) 
gm(inn, 3 ) =gce( imaxl, 3 ) 

300  continue 

3 456789012345678901234567890 123 4567390 122 45678«01234567890123456789012 
xsO .do 

xidif ( 1 ) =0 .dO 
xis ( 1) sO .dO 

xxlsgm(2. 1)  **2«-gm(3, 1)  ••2 
xxadsqrc (xxl) 

xo  ( 1)  *q*(pdphot*xx/  (gm(  1, 1)  'ev:  '  ♦ixxnnd)  -.xnpi)  /  caw)  'ddd 
do  810  is2.nnx  6'44 

X3X4'Xwd 

xxlag»( i. 2) ••2*gm( i, 3 ) **2 


xxadsqrc (xxl) 

spnapdphoc'xx/  'gin(  i,  1)  •evi ) 
xrnatxxnni i) -xnpi) /taw 
parg=2 .dO* (a/xw) ••2 
argsparg'x'x 
xnxo=xno*dexp( -arg) 
xrvxsxnxo'vxnpi 
dxnxsxnx* ( -parg'x) 
d2xnxsxnx*parg*x* (parg*x-l .dO ) 
xj  (  i)  =q*ddd*  (difc*d2xnx+xrn-t-spnj 
xidiC ( i ) =-q*difc*dxnx 
0  continue 
xis ( 1) =0 .dO 
nnxl=nnx-l 
do  812  i=2,nnxl 

x6=(v(i+l)-v(i-l))/ (2 .dO*xwd*rsheet ) 

2  xis(i)=-x6 
i=nnx 

xis(i)=-(v(i)-v(i-l) )/ (xwd*rshe6C) 
do  813  i=l,nnx 

xjaug( i) =xxnn( i) ‘xno'xno* (2 .d-42 ) ‘q'df loat (mwell) ‘Iz 

3  xjcav(  i)  =q‘'wlg*2  .dO*  (9  .d-17 )  •  (xxpp  (  i )  +xnpo)  *xnpi*xnpi 
1  /xxpp (i) *9 .d-17 

xxl=xwidth* . 5d0 

call  tsum(xidi£,xwd,xxl, val) 

avedi£=val 

va2=val*l .d-4 

xxl=xwidth*0 . 5d0 

call  tsuni(xj  ,xwd,xxl,  val) 

avexjsval 

xxlsxwidth* . 5d0 

call  tsutn(xi3,xwd,xxl, val) 

avexissval 

xi tha>  ( avex  j  'xvidth-*- 2 .  dO  *avedi  £  *ddd )  •  1 . 2d0  *xlength 

xithc=2 .dO*avexis*xlength 

xith=xitha+xithc 

do  830  i=l,nnx 

xxsgmi  i,  2)  **2+gin(  i,  3 )  **2 

xxsdsqrt (xx) 

0  xduffli i) =xx*xxnn( i) 
xxlsxw 

call  tsum<xduin,xwd,xxl,  val) 
aveg= va 1 • nwe 1 1 
do  840  i=l,nnx 
3  xduin(i)  =xxnn(i) 

call  t3uin(xduin,xw,xxl,  val) 

avennsval 

gmodsaveg / avenn 

xxl®xwidth*0 . SdO 

call  tsuin(xjaug,xwd,xxl,  val) 

avjaugsval 

call  csuinix3cav,xwd,xxl,  val) 
av^ cavsval 

c j  cavsavj  cav'wlg "xwidth 

ciaug=av]aug'wlz*d£loac  {ttn<feil )  *x length 'xwidth 
rrsrri*rr2 
rral .dO/rr 

oplen>2.d0*dlog(rr) /gmod 
cin=cin7  *1000 .dO 
cthaxith'lOOO .do 
cdi£3avedi£*l .d-4 
cisaavexi3* . OldO 
c^deavexi •! .d-4 
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caugsciaug'l .d5 
ccavsc^  cav* 1 . d9 

write!  6, 342)  cm,  cch.  cdif ,  cis,  c:d,  gmod.caug,  ccav 
142  forraac  (2x,  fS  .2,  :gll .  4, 2£6 . 3  ) 

150  continue 
155  continue 
>00  continue 
100  close(5) 
close! 6) 
close!?) 
stop  555 
end 

subroutine  tsun!xx,xdx,xl. val) 

implicit  real's (a-h, o-z) ,  integer*4 ! i-n) 

dimension  xx!25) 

xf=xx!l)  >oc!25) 

xoddsO  .do 

xevensO .dO 

do  20  i=2,24 

if {i.eq.i/2*2)go  to  10 

xoddsxodd-t-xx !  i ) 

go  to  20 

10  xeven=xeven+xx! i) 

20  continue 

vals  !xf  •♦•4  .d0*xodd+2  .dO'xeven)  *xdx/3  .dO 

val=val/xl 

return 

end 

subroutine  £ill!ii) 

implicit  real's (a-h, o-z) ,  integer'4 (i-n) 
character'!  aa 

dimension  gte!4,200) ,xmax!2) ,xmin!2) ,aa(51) 
coinnon  /g/gte,xmax,xmin,aa 
do  20  i=l,51 
20  aa(i)='  ' 

aa!l)='  :  ' 
do  30  i=2, ii 
30  aa(i)  =  '" 
return 
end 

subroutine  input 

implicit  reai'8(a-h,o-z) ,  integer'4 ! i-n) 
dimension  xxnn!25) ,xxpp(25) 

comnon  / input /xlength, xwidth, xw.wlz.wlb, buff p, buff n, ddd, 

1  pdphot, rsheet, rrl, rr2,difc, alpi, cinj ,xwd,xxnn,xxpp,xnpo,xnpi, 

2  xno,xpo,cinl,cin2,a,nwell,nnx 

456789012345678901234567890123456789012345678901234567890123456789012 


tn)ta  units  are  used 

xlength=200 .d-6 
xwidth=80 .d-6 
xwsxwidth'O . 05d0 
wlb=50d-9 
pdphotslO .d5 
rsheetsSO .dO 
rrl3.85 
rr2a.3 
difcal .d-3 
alpislOO .dO 
taws 1 .d- 12 
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q=1.60219d-13 
asO .2d0 

xnpi  =  QW  ir.crinsic  acrrier  desicy  [per  m  cube) 

;<npo  =  p-Dutfer  zone  doping  density  [per  m  cube) 

caw  =  acrrier  life  time  [s] 

q  =  electronic  charge  [Q] 

cm;)  =  injected  current  for  lasing  action  (A) 

xlengchs  laser  cavity  length  [tij 

xw  =  width  [.Til 

xwidth  =  total  width  laser  chip  [m] 

nwell  =  number  of  qws 

wlz  =  qw  width  [.n] 

wlb  =  barrir  width  [rn] 

buffp  =  p-type  buffer  thickness  [m] 

buffn  =  n-type  [m] 

pdphot  =  optical  power  density  [watts/m  square) 

rsheet  =  p-type  buffer  sheet  resistance  [ohms-m] 

rrl  £«  rr2  =  mirror  reflectivity 

difc  =  diffusin  constant  (m  square/s] 

alpi  =  cavity  losses  [per  m) 

cinl=cinj 

xjrcinl/ (xw'xlength) 
adsddd 

xno=taw*xj/ (q*ad) 
xwd=xw*0 . 5d0 
xn=xwidth/xwd 
nnx=int  (xn)  -t-l 
if (nnx.gt .25) nnx=2S 
xwd=xwidth/df loat (nnx) 
x=0 .do 
xw2=xw* .5d0 
do  20  isl,nnx 
xsx-fxwd 

if (x.gt.xw2)go  to  10 
xxnn  ( i )  sxnpi-f  xno 
xxpp  ( i )  axnpxj-xno 
if (xpo.gt.xnpo)xxpp(i) =l.d0 
go  to  20 
xx=a*x/xw 
xx=xx**2 
x2=xno*dexp( -XX) 
xxnn ( i ) =xnpi+x2 
xxpp ( i ) =xnpo-x2 
if (xpo.gt . xnpo ) xxpp ( i ) =1 .dO 
continue 
do  30  i=l,nnx,3 
jl  =  i 
j2=jl+2 

if ( j2.gt.nnx) j2=nnx 
write (6, *) (xxnn( j ) , j=jl, j2) 

write(6,*)'  hole  density  (  m  i.nverse  cube)  ' 
do  33  i=l,nnx,3 

n=i 

]2=)1>2 

if ( j2 .gt .nnx) j2=nnx 
write(6, *) (xxpp( j) , j=jl, )2) 
re-urn 
end 

subroutine  plot  ( i  jlc,  ktem,  imax) 
implicit  reai*8(a-h,o-z) ,  integer'4 ( i-n)  5^7 
dimension  gtet 200, 4 ) ,  aa( 51)  ,  :<max( 2)  ,  :aninl2) 
character *1  aa 


conmon  /g/gte,xsiax,xmin,aa 
if  (xinaxtkcem)  )  5, 600, 5 
5  if  ()ccem.eq.2)  go  zo  10 
write { 6 . • ; ' 

wrice(6,*)'  qw  gain  vs  photon  energy  plot  **" 

write(6,*)'  *•*  transfer  electric  spectrum  •**< 

write(6,*)'  Max=' , xmaxt 1) , ' ,  Indx=',imax 

write(6,*)'  Min= ' , xmin ( 1) 

write ( 6, • ) ' 
iksktem-f  1 
go  to  20 
.0  write(6,  *)  ' 

write(6,*)'  •••  qw  gain  vs  photon  energy  plot  •**' 

write<6,*)'  •••  transfer  magnetic  spectrum 

write(6,*)'  Max= ' , xmax ( 2 )  Indx=',iroax 

write(6,*)'  Min=‘ ,xmin(2) 

write(6,*)' 
ik=ktem+l 
,0  continue 
ial=imax-25 
if ( ial . It . 1) ial=l 
ia2sinuuc-*-25 
if ( ia2 .gt. i jk) ia2=ijk 
xxsxnvax(ktem) 

,56789012345678901234567890123456789012345678901234567890123456789012 

write(6,*)'  E(ph)  g(E)  .  Plot  •***' 

write (6, •) '  ' 
do  500  is=ial,ia2 
xesgteds,  ik) 

xdsxe/xx*S0.  , 

ixsint(xd)-*-l 
if (xe.lt. xx2) go  to  500 
if (ix.gt.51) ixsSl 
call  fill(ix) 

write (6,505) gte ( is , 1 ) , xe , aa 
0  continue 

5  fonnat(2x, f7 .3,gl2.4, 2x, 51al) 
return 

0  write(6, 605) xmax (ktem) 

5  fonnat(2x,'  *•  zero  maximum  gain  (',g20.5,')  *•') 

write(6,*)'  ' 
return 
end 

real's  function  red(xrm,d, e, egn) 
implicit  real*8(a-h, o-z) ,  integer*4 ( i-n) 
real*8  mo, k,me,mhh,mlh, lz,meh,mehx,mx 

connon  /a/pi, c, epo, uo, h, hb, q, mo, k,me,mhh,mih, er, ego, vo, voo, Iz 
pi2=pi*pi 
hb2shb*hb 
x=a-egn 
if (x)20, 10, 10 
0  xl=x/d 

x2=dsqrt (xl) 

d2=dsqrc (d) 

x3=xrm'2 .d0/hb2 

x32=dsqrt (x3 ) 

x33sx32*x32*x32 

xxax33*d2*inc (x2) / (pi2*2 .dO) 

redsxx 

return 

D  write(6,*)'  a»',a.  '  and  egn*', egn 

write(6,*)'  X3',x,',  :<1  <  0  in  red  routine' 
stop  100 


e 

4 


end 

real  3  function  xmcmlxmo,  ecnl,  eipcr.j 
implicit  reai*9  (a-h,  0-2 )  ,  integer*'!  ( i-n) 

common  /e/xme, xah, hbl , hbl2, pil, pii2 , xkt , xiz,  ecn, ecv, xnn, xpp 

x=xino*  ( 1  .dO-ecnl/eipcnj  *1 .  SdO 

xiatmsx 

return 

end 

real*8  function  xrote (xmo, ecnl, eipcnj 
implicit  real'8 (a-h, 0-2) ,  integer*4 ( i-n) 

connnon  / e/xme, xmh.hbl  hbl2 , pil , ril2 , xkt ,  xlz,  ecn, ecv. xnn, xpp 

x=xmo* ( 1 .dO+ecnl/eipcn) * .7 SdO 

xmte=x 

return 

end 

real*8  function  a2inv(ev3) 
implicit  real*8 (a-h, 0-2 ) 

common  /e/xr‘!,xmh,hbl,hbl2,pil , pil2 ,  xkt , xlz.  ecn.  ecv,  xnn, xpp 

x=xnn*pil*hbl/xme*hbl/ev]  /xkt*xl2 

if (X. It . 0 . icO)go  to  10 

y=dexp(x) -1 .dO 

ef c=ecn+rkt*dlog (y ) 

a2inv=efc 

return 

y=x 

j  =  l 

do  20  jj=2,20 

x=x*x 

yl=y 

y=y+’X/dble(  j ) 
dy*y-yl 

if (dy . le.y*l .d-3 )go  to  25 
continue 

ef«_=ecn+xkt*dlog(y) 

go  to  5 

end 

real*8  function  a4inv(evj) 

implicit  real*8(a-h,o-2) ,  integer*4 ( i-n) 

common  /e/xme, xmh, hbl,  hbl2, pil, pil2, xkt ,xlz,  ecn, ecv, xnn, xpp 

x=xpp*pil*hbl/xinh*hbl/ev3  /xkt'xlz 

if (X. It . 0 . Id0)go  to  10 

y=dexp(x) -1 .dO 

efv=ecv+XKt*dlog(y) 

a4inv=efv 

return 

y=x 

j  =  l 

do  20  jj=l,20 

j=j*j  j 

x=x*x 

Yl--/ 

y=y*x/dble ( j ) 
dy^y-yl 

if (dy. le.y'l .d-3)go  to  25 
cont  ir.ue 

efv=ecv>xkt •dlog(y ) 

go  to  5 

end 

real*8  function  xintedx.it) 
implicit  reai*8(a-h,o-2) ,  integer*4 ( i-n)  6^9 
dimension  tt (2, 512, 512) 
common  /f/tt,kk.mm 


1  =  XX 

if ( i .ne . 1 ) go  co  7 
do  5  3  =  1,. Tan,  5 

5  -vriceie,*)'  tr  1 ' ,  3 ,  ' ,  1  )  =' ,  ct !  i*l,  3 , 1) 

'■  continue 

do  20  3  =2, nan 
3X=4**  f j-1) 
f jx=df loan ( jx) 

3XX=3X-1 

f jxxsdf loat ( jxx) 

3l=j-l 

7:=: 

do  10  k=jj,mm 
klrkfl 

tc (i, j j , k) =( f jx*tt (i, jl,kl)-tt(i,3l,k))/f3XX 
10  continue 
20  continue 

if (ix-l)50,30,50 
30  ial=it-10 

if ( ial . le . 0) ial=l 
ia2  =  it-»-10 

if ( ia2 .gt .mm) ia2=mm 

write(6,*)'  ••  integrated  values 

do  40  j=ial, ia2,3 

40  write(6,*)'  xinte:  ' , tt ( i, j , 3 ) , tt ( i,  j  +  1.  j  +  l) , 

1  tt  (i,  j  v2,  j-*-2) 

50  continue 

xintestt (i.mro.mm) 

return 

end 

real*8  function  dfloat(i) 

implicit  real*8(a-h,o-z) ,  integer*4(i-n) 

ixsi 

d£loat=dble(ix) 

return 

end 

real*8  function  fc (xtrn, xme, ecvx, egnn, ec, efc.xkt) 

implicit  real*8(a-h,o-2) ,  integer*4 ( i-n) 

xsxrm/xme* (ecvx-egnn) /xkt 

y= (efc-ec) /xkt 

if (x.gt.l.d5)go  to  10 

xl=dexp(x-y) 

x2=l.d0/  (l.dO-fXl) 

5  fc=x2 
return 
0  x2=l.d-5 

go  to  5 
end 

real*8  function  fv(xrra,xmh, ecvx, egnn, ev.efv, xkt) 
implicit  real*8(a-)T,o-z)  ,  integer'4  ( i-n) 
xx=ecvx-egnn 
X =xx • xrm/ xmh / xkt 
yy=efv-ev 
y=yy/xkt 
xx=-x+y 

if (xx.gc . 1 .d5)go  to  10 
xl=dexp( -x+y) 
x2=l.d0/ (l.dO^xl) 

5  fvax2 
return 
10  x2»l.d-5 

go  to  5 
end 
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subroutine  eiecedcj 

implicit  real*? (a-h, o-z) ,  integer'i d-nj 
reai'3  mo,  ic,me,mhh,mih,  Iz.raeh.menx.mx 

common  /a/pi,  c,  epo,  uo,  h,  hb,  q,  rr.o,  k.me.  mhh.mih,  er,  ego,  vo,  voo,  iz 

common  /b/men,mehx, enl 

common  /c/mx, evj ,  p2 

dimension  ee ( 4 ) , hh( 4 ) , eh ( 4 ) 

common  /d/ee,hh,il 

This  part  is  from  file  qwee 
as  of  1/4/1992 


if (ic.gt.l)go  to  10 

pi=3 .141593d0 

p2=pi+pi 

c=2.997925d8 

epo=8.8542d-12 

uo=l .26d-6 

h=6.6262e-34 

hb=1.05459e-34 

q=1.60219d-19 

mo=9.1095d-31 

evj=1.60219d-19 

mx=mo 

k=8.62d-5 

units;  elm],  epo{f/m],  uolh/m],  h(J.sl,  hb(J.s] 
q(C],  mo(kg],  evj (conversion  from  ev  to  J), 
k[eV/K  degrees] 


ieesO 

ieosO 

inumsl 

me=.0665*mo 

mhh= . 3  4 ‘mo 

mlh=.094*mo 

er=12.9 

ego=l . 424 

wo=0 .56 

il=2 

i2=il 

vo=voo*evj 
xx=0 . 

mehsfme (XX) 
xx=.2 

mehxsfme (XX) 

wl=vvo 

in=l 

if ( ic .eq. 1 ) go  to  15 
do  14  ^=l,il 
eh ( 3 ) =ee ( j ) * . 85*evj 
continue 

call  energy  (wl,  i2,  eh,  in,  iee,  leo, 
if  (  ic.eq.  1)  il  =  iee-*-ieo 
do  20  i=l,ii 
ee( i) -eh( i) /ev^ 

XXaO .dO 
mehafhh(xx) 
xx3.2d0 
fflehxsfhh(xx) 

i2sil 

•/v2-:i  .dO-wo 


mum,  1C) 


22 


in=i2 

if (ic.gc.l)go  zo  22 
ihesiee 
ihosieo 
1 nuns 2 

if ( ic .eq. 1) go  to  25 
do  24 

4  eh( j ) shh( j ) * . 85*ev3 

5  continue 

call  energy  (w2.  i2,  eh,  in,  ihe,  ir.o,  mum,  ic) 
do  30  isl,il 
30  hh(i) seh(i) /ev^ 
if ( ic.gC.O) return 
do  80  i=l,il 
30  write(6, *) 

1  '  E(',i,',  e)  =  ',ee(i) 

do  90  isl,il,2 
90  write(6, *) 

1  '  E( ' , i, ' ,  h)  =  ' ,hh(i) 
return 
end 

subroutine  energy  (wo,  ii,  eh,  in,  le,  io,  inuro,  ic) 
implicit  real*a (a-h, o-z) ,  intecer*4 ( i-n) 
real's  mo,  )t, me, mhh, mlh,  l2,meh,r.ehx,mx 

common  /a/pi, c, epo, uo, h, hb, q,mo, <,me,rahh,mih, er, ego, vo, voo, Iz 

common  /b/meh,mehx, enl 

common  /c/mx,ev^,p2 

dimension  eh(4) 

xp2s( (pi*h/lz) **2) *0 . 5/meh 

testsO . IdO 

indxsl 

2  continue 

Calculations;  electron/hole  energy  for  odd  number 
do  10  isl,ii,2 

enls( (dble(i-l) *pi*h/lz) **2) *0. 5/meh 
en2s( {dble(i) *pi*h/lz) **2) *0. 5/meh 
desen2-enl 
enlsenl+de*0 . 85d0 
en2senl 

if ( ic.gt . 1) en2seh( 1) 
vmswo'vo 
ixsO 
itestsO 

5  ixsix+1 

if ( ix.gt ,999)go  to  22 
enlsen2 

call  qnl(wo,  vl,v2,  itest) 
if (itest .ne.0)go  to  110 
if (v2) 6,27, 6 

6  dens-vl/v2 
en2senl^den 
if  (en2  .  It  .’/m)go  to  101 
en2sv7n* .  95d0 

101  continue 

if ( ix. le. 2)go  to  5 
d2sdaba (enl'test) 
if (dabs(den) .gt.d2)go  to  5 
eh( i) 3en2 
go  to (7, 10) , inua 

7  if  (  ic  .gt .  Dgo  to  10 
iosiO'f'l 
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go  CO  10 
eh(i) aenl*.95d0 
go  CO  27 
10  continue 

wrice(6,*)'  ix=',ix, '  and  eh ( i i  =  ' , en2 ,  '  for  odd  i' 

if ( io.gc . l)go  to  12 

de=vni-en2 

Test  for  difference  in  barrier  height  and  electron 
energy  level  computeed  just  now 

if  (de.  le.vin'0 . 1)  return 

12  continue 
indxsl 

go  to  (14,13), inun 

13  if ( ie.eq. 0) return 

14  continue 

Calculations:  electron/hole  energy  for  even  number 
do  20  i=2, ii, 2 

enl= ( ( (float ( i-1) *pi) *h/l2) »*2) *0 . 5/mehx 

enusenl-fxp2 

en2aenl+xp2* . 85d0 

if ( ic .gt . 1) en2aeh(2) 

ix=0 

itestsO 

15  ix=ix+l 

if (ix.ge.999)go  to  30 
enlaen2 

call  qn2  (wo,  v3,  v4,  itest) 
if (itest.ne.O)go  to  120 
if (v4)16,30,16 

16  dens-v3/v4 
en2senl-*-den 

if (en2 . It .vm)go  to  102 
en2=vin* .  95d0 
02  continue 

if ( ix. le.2)go  to  15 
d2=dabs (enl*test) 
if (dabs(den) .gt.d2)go  to  15 
eh( i) =en2 
go  Co( 17 , 20) , inum 

17  if ( ic.gt . 1) go  to  20 
iesie-fl 

go  to  20 

20  eh(i)=enl*.95 

go  to  32 
20  continue 

write(6,*)'  ixa',ix, '  and  eh( i) =' , en2, ' ,  even  i' 
return 
22  continue 

write(6,*)'  Out  of  loop  10  ' 
go  CO (26, 27) , indx 
go  CO  27 
26  indxa2 
go  CO  2 
>7  il  =  i 

eh( il) =en2 

if  (  inum.eq.  1 )  ioaiO'f  1 
return 

10  go  CO (31, 32) , indx  6-53 

go  CO  32 
indxa2 


11 


go  CO  14 
-.1  i2=l 

eh( i2) =en2 

if ( inum.eq. 1) ie=ie+l 

return 

end 

subroutine  qnl  ( '/vo,  f ,  df ,  itest ) 
implicit  reai'9 (a-h, o-z) ,  integer *4 ( i-n) 
real*8  mo.  k.me.mhh.mlh,  l2,meh..'r.ehx,mx 

common  /a/pi, c, epo, uo, h, hb, q.rno, k.me.mhh.mlh, er. ego. vo. voo,  Iz 
common  /b/meh.mehx. enl 
common  /c/mx.ev].p2 
ar=2 .dO*meh*enl/h/h 
asdsqrc (ar) 
ar=meh/enl* . 5d0/h/h 
dasdsqrc (ar) 
vx=vvo*vo-enl 
if (vx) 20. 20. 10 
10  ar=2  .dO*mehx*’/x/h/h 
b=dsqrc (ar ) 
ar=mehx/2 . /vx/h/h 
db=-dsqrc (ar) 
ar=a*lz*0 . 5d0 
alsdsin (ar ) 
a2=dcos (ar ) 
a3=a2/al 
f  =a*a3-»-b 

df=da* (a3-ar/al/al) +db 
itesCsO 
return 
20  vxaenl 

write(6,*)'  enla'.enl.',  wos‘,\’vo.',  andvo=*,vo 

write(6,*)'  vxs'.vx,'  <  0  in:  qnl' 

itescsicest-»>l 

return 

end 

subroutine  qn2(wo,  f  ,df,  itest) 
implicit  real*8(a-h,o-z) ,  integer*4 { i-n) 
real*8  mo, k.me.mhh.mlh, Iz.meh.mehx.mx 

common  /a/pi, c, epo, uo, h, hb,q,mo, k.me.mhh.mlh, er, ego, vo,  voo,  Iz 
common  /b/meh,mehx, enl 
common  /c/mx,evj,p2 
ar=2 .d0*meh/h*enl/h 
asdsqrt (ar) 
ar=meh/enl* . 5d0/h/h 
dasdsqrt (ar) 
vx=vvo*vo-enl 
if (vx)20.20, 10 
10  ars2 .d0*mehx*vx/h/h 
badsqrt (ar) 
ar=mehx/2 . /vx/h/h 
db=-dsqrt (ar) 
araa*l2*0 . 5d0 
alsdsin(ar) 
a2sdcos (ar ) 
a3=al/a2 
f=a*a3-b 

df  sda*  (a3'*'ar/a2/a2)  -db 

itestaO 

return 

20  vx*enl  6-54 

write(6,*)'  enla'.enl. vn/oa'  '.-vo,  ' ,  andvoa'.vo 
write(6,*)'  '.xa'.'/x,  '  <  0  in:  cni' 


itestsitest'fl 

return 

end 

real*8  functicn  fme(x) 

implicit  real's (a-h, o-z ) ,  inceger'4 (i-n; 
real'8  mo, k.me.mhh.mlh, lz,meh,mehx,mx 

common  /a /pi, c, epo, uo, h, hb, q, mo, k, me, mhh.mlh, er, ego, vo, voo,  Iz 

common  /b/meh,mehx. enl 

common  /c/mx,ev3,p2 

fmea ( . OSSSdO* . 083d05*x) 'mo 

return 

end 

real's  function  fhh(x) 

implicit  real*8(a-h,o-z) ,  integer'4 ( i-n) 
real's  mo, k,me,mhh,mlh, lz,meh,mehx,mx 

common  /a/pi, c,  epo,  uo, h, hb, q,mo, k,me,mhh,mlh,  er, ego, vo, voo, Iz 

common  /b/meh,mehx, enl 

common  /c/mx,evj,p2 

fhh=( .034d0*.175d0'x) 'mo 

return 

end 

real's  function  flh(x) 

implicit  real's (a-h, o-z) ,  integer*4 ( i-n) 
real's  mo, k,me,mhh,mlh, lz,meh,mehx,mx 

connnon  /a/pi, c, epo, uo, h, hb, q, mo, k,me,mhh,mlh, er, ego, vo, voo,  Iz 

common  /b/meh,mehx, enl 

common  / c /mx , ev j , p2 

flh=( .094d0+.069d0'x) 'mo 

return 

and  t 

real's  function  re£(x) 

implicit  real's (a-h, o-z) ,  integer'4 ( i-n) 

ref *3 . 590-0 . 170'x+0 . 091'x'x 

return 

end 
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Appendix  A 


Nonradiative  Recombination 
Leakage  Current-Auger  Effect 


4 


An  electron-hole  pair  can  recombine  nonradiatively  meaning  that 
the  recombination  can  occur  through  any  process  that  does  not  emit 
a  photon.  In  many  semiconductors,  for  example  pure  germanium  or 
silicon,  the  nonradiative  recombination  dominates  radiative  recombi¬ 
nation,  The  effect  of  nonradiative  recombination  on  .the  performence 
of  injection  lasers  is  to  increase  the  threshold  current.  If  r„r  is  car¬ 
rier  lifetime  associated  with  the  nonradiative  process,  the  increase  in 
threshold  current  desity  is  approximately  given  by: 

•Inr  ~  (Al) 

where  is  the  carrier  desity  at  threshold,  d  is  the  active  layer  thickness 
and  q  is  the  electron  charge. 

A.  Auger  Effect 

Since  the  poin«  nng  work  by  Beattie  and  Landsberg  [33],  it  is  generally 
accepted  that  Auger  recombination  can  be  a  major  nonradiative  mech¬ 
anism  in  anrrow-gap  semiconductors.  Recent  attention  to  the  Auger 
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effect  has  been  in  connection  with  the  observed  higher  temperature  de¬ 
pendence  of  threshold  current  of  long-wavelength  InGaAsP  lasers  com¬ 
pared  to  short  wavelength  GaAs/AlGaAs  lasers.  It  is  generally  belived 
that  the  Auger  effect  plays  a  significant  role  in  determining  the  ob¬ 
served  higher  temperature  sensitivity  of  threshold  current  of  InGaAsP 
lasers  emitting  near  1.3/im  and  l.SS/xm  (34,35).  In  quantum  well  struc¬ 
tures,  the  modification  of  the  density  of  states  may  reduce  the  Auger 
recombination  rate  to  allow  long- wave-length  QW  lasers  to  have  a  low 
temperature  sensitivity  [36,37]. 

In  the  following,  formulation  for  Auger  rate  calculation  in  QW  is 
outlined.  The  various  band-to-band  Auger  processes  are  shown  in  fig¬ 
ure  (.41).  These  proceses  are  labeled  CCCH,  CHHS,  and  CHHL  in 
which  C  stand  for  the  conduction  band,  H  for  the  heavy-hole  band,  L 
for  the  light-hole  band  and  S  for  the  spin  split-off  band.  The  energy 
versus  wavevector  (k^^ky)  diagram  is  shown  for  one  subband  (one  state 
in  the  z-direction)  in  each  of  the  bands.  The  dashed  curve  (for  CCCH) 
shows  a  second  subband  in  the  conduction  band.  The  dashed  line  shows 
a  process  in  which  the  electron  makes  a  transition  to  a  different  sub¬ 
band.  The  CCCH  process  involves  three  electons  and  a  heavy  hole  and 
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is  dominant  in  n-type  material.  The  CHHS  and  the  CHHL  processes 
are  dominant  in  p-type  material.  Under  high- injection  conditions  as  is 
present  in  lasers,  all  of  the  above  mechanisims  must  be  considered. 
However  the  Auger  recombination  rates  for  the  CCCH,  CHHS  and 
CHHL  processes  have  been  previously  calculated.  In  the  nondegerate 
approximation,  the  Auger  rate  R  for  the  CCCH  process  -/aries  as: 


with 


exp(^-AEJkBTy  (A2) 


AEc  - - - , 

m»  +  2mc  —  met 


(A3) 


* 


where  n,  p  are  the  electron  and  the  hole  concentrations  respectively, 
is  the  effective  masses  of  the  electron  at  the  higher  energy  E|  (electron 
2’  in  figure  (Al)  and  me,  are  the  band  edge  effective  masses  of  the 
electron  and  hole,  respectively.  is  the  energy  different  between  the 
lowest  conduction- band  subband  and  the  highest  heavy-hole  subba^d. 
For  most  cases,  E^=Eg  (band  gap).  The  strongest  temperature  depen¬ 
dence  and  the  band  gap  dependence  of  the  Auger  recombination  rate  is 
evident  from  the  above  equation.  The  Auger  rate  increases  rapidly  with 
increasing  temperature  and  with  decreasing  band  gap.  For  the  CHHS 
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and  CHHL  processes,  the  Auger  rate  in  the  nondegerate  approximation 


varies  as: 

~  •  n  •  expi^-C^EclksT^, 


(A4) 


with 


Ai) 

rrxc  +  2m»  —  m,  ’ 


(A5) 


for  CHHS,  and 


A£=  , 

me  +  2m„  —  mi 


(A6) 


for  CHHL,  where  m,,m/  are  the  effective  masses  of  the  split-off-band 

s 

and  light  hole,  respectively,  and  Ai  is  the  energy  difference  between  the 
top  of  the  heavy-hole  band  and  that  of  the  split-off  band. 


For  undoped  semiconductors,  such  as  the  active  region  of  the  semi¬ 
conductor  laser,  the  Auger  rate  at  an  injected  carrier  density  n  is  given 
by: 

R  =  Cn\  (A7) 

where  C  is  known  as  the  Auger  coefficient.  The  Auger  lifetime  is 
given  by: 

U=n/R=-^-  (.M) 

C 
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Thus  the  increase  in  the  threshold  current  due  to  Auger  recombination 
can  now  be  obtained. 
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1 .  Abstract 


A  technique  is  described  to  equalize  the  optical  path  lengths  for 
a  multipath  fiber  optic  distribution  network.  A  novel  path  length 
adjuster  is  fabricated  for  a  four  channel  system  and  used  to 
provide  continuous  adjustment  of  the  path  lengths.  Experimental 
data  is  presented. 

2.  Introduction 

Phased  array  antennas  play  an  important  role  in  today's  high 
performance  communication  and  radar  systems.  Optical  components, 
because  of  their  light  weight,  wide  bandwidth,  and  immunity  to 
electromagnetic  interference,  are  expected  to  play  a  major  role  in 
future  phased  array  systems.  This  report  addresses  several 
problems  that  may  be  encountered  in  the  implementation  of  a  phased 
array  with  an  optical  distribution  system. 

One  problem  is  the  equalization  of  the  microwave  path  lengths  for 
the  various  channels  of  the  system.  A  technique  to  measure  the 
microwave  path  length  is  described  in  this  report.  The  method 
uses  a  network  analyzer  to  compare  the  phases  of  an  RF  modulated 
light  signal  in  the  different  branches  of  the  fiber  optic 
distribution  network.  By  equalizing  the  phases  of  all  the 
branches,  the  path  lengths  may  be  equalized. 

Another  problem  is  that  of  providing  a  suitable  broadband  optical 
technique  for  adjusting  the  phase  of  each  channel  to  an  arbitrary 
value.  One  approach  is  to  vary  the  length  of  the  optical  fiber  by 
inserting  a  path  length  adjuster  in  the  fiber  path.  This  results 
in  true-time-delay  beamsteering.  The  path  length  adjuster 
described  below  consists  of  two  GRIN  lenses,  one  to  collimate  the 
beam  and  the  other  to  refocus  it  into  the  fiber.  It  is  shown  to 
have  low  loss  and  can  provide  time  delays  of  up  to  a  nanosecond. 

3.  Theoretical  and  Experimental  Design 

3.1.  True-Time-Delay  Beam  Steering  System 

A  phase  shift  (|3)  may  be  introduced  between  successive  antenna 
elements  of  a  phased  array  by  introducing  a  progressive  time  delay 
Tg  between  each  element.  Such  a  time  delay  would  result,  for 
example,  if  the  driving  signal  were  split  into  N  paths  as  in  Fig. 

3.1,  each  differing  in  length  by  an  amount  Lg  =  vTg ,  where  v  is 
the  signal  speed  in  the  delay  medium.  If  the  frequency  of  the 
driving  signal  is  f,  the  phase  shift  (3  is  then  given  as 

(3  =  -  2TTTg/T  =  -  <2rTTo)f  <3.1) 

where  T  =:  1/f.  The  negative  sign  indicates  that  the  phase  of 
element  n-fl  lags  that  of  element  n. 
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3.2.  Fiberoptic  feed  system  for  a  phased  array  antenna 

A  block  diagram  for  a  four  element  phased  array  is  presented  in 
Fig.  3.2. 

A  signal  source  modulates  a  laser  diode  <L0).  The  modulated 
optical  signal  is  carried  by  a  length  of  single-mode  fiber  (SMF) 
and  divided  into  four  paths  of  approximately  equal  length  using 
three  1x2  single-mode  couplers  <C1,  C2,  C3).  Each  of  the  four 
beams  is  then  collimated  in  free  space  using  a  graded  index  (GRIN) 
lens  (G1-G4).  The  collimated  beam  traverses  a  variable  distance  L 
and  is  refocused  by  a  second  GRIN  lens  (Gl'-G4')  into  a  short 
length  of  multimode  fiber  that  is  pigtailed  to  a  high  speed 
photodetector  (PD1-PD4).  The  subassembly  consisting  of  the  two 
GRIN  lenses  is  referred  to  as  a  path  length  adjuster  (PLA)  and  is 
described  below.  Finally,  the  signal  is  amplified  (A1-A4)  and 
delivered  to  the  antenna  elements  <£1-E4). 


3.3.  Path  Length  Adjusters 

The  optical  signal  originates  at  the  laser  diode  and  is  split  into 
four  fiber  optic  paths  for  delivery  to  the  photodiodes.  For  a 
broadside  beam  each  fiber  optic  path  must  be  of  equal  length. 
Moreover,  steering  the  beam  requires  a  progressive  increment  in 
path  length.  Both  of  these  requirements  are  satisfied  by  the  path 
length  adjuster  (PLA)  of  Fig.  3.2.  A  more  detailed  drawing  of  the 
PLA  can  be  found  in  Fig.  3.3. 

The  PLA  consists  of  a  collimated  output  beam  from  one  leg  of  the 
coupler,  a  variable  path  length  over  which  the  beam  travels,  and  a 
GRIN  lens  used  to  refocus  the  beam  into  the  pigtail  of  a 
photodiode.  The  output  of  the  coupler  is  collimated  by  a  GRIN 
lens  assembly  that  is  especially  designed  for  this  experiment  and 
is  detailed  in  the  next  section.  Both  the  output  and  input  lens 
assemblies  are  mounted  on  a  common  optical  axis  using  optical 
mounts  which  allow  tilt  and  z  (longitudinal)  adjustment.  In 
addition,  the  input  lens  is  adjustable  in  the  xy  (transverse) 
plane. 

Besides  providing  a  means  to  steer  the  beam,  the  PLA  also  permits 
initial  adjustment  of  the  array  for  zero  phase  difference.  Even 
though  corresponding  fibers  in  the  four  legs  of  the  system  are  cut 
to  nearly  equal  lengths,  some  uncertainty  in  length  exists  due  to 
the  variability  within  the  couplers,  the  photodiodes,  and  the 
amplif iers. 

It  is  possible  to  measure  the  length  difference  between  any  two 
legs  of  the  array  using  a  network  analyzer.  By  measuring 
length  difference  will  manifest  itself  as  a  phase  difference 
between  legs.  The  PLA  may  be  used  to  equalize  the  phases  and 
thereby  reduce  the  length  differences  to  zero.  Since  the  network 
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analyzer  can  resolve  phase  differences  of  less  than  0.5®,  at  a 
frequency  of  10  GHz  it  is  possible,  in  theory,  to  resolve  length 
differences  as  small  as  40  nm*  In  practice  it  is  found  that  a 
length  of  250  Mm  can  be  resolved  with  no  difficulty,  and  with 
averaging  this  value  may  be  lowered  toward  its  theoretical  limit. 


3.4.  GRIN  Lens  Collimators 

A  necessary  condition  for  projecting  the  beam  over  a  length  of 
free  space  and  refocusing  it  into  a  fiber  is  that  the  beam  be  well 
collimated.  Also,  for  efficient  coupling  it  is  necessary  that  the 
core  of  the  source  fiber  be  no  larger  than  that  of  the  receiving 
fiber.  The  GRIN  lens  collimator  of  Fig.  3.4  was  designed  to  meet 
these  conditions. 

The  collimator  consists  of  a  glass  sleeve,  the  elastomeric 
subsection  of  a  GTE  lab  splice,  and  a  0.23  pitch  GRIN  lens.  The 
fiber  is  centered  in  the  glass  sleeve  and  held  firmly  in  place  by 
the  elastomeric  splice.  The  fiber  passes  through  the  elastomeric 
splice  and  protrudes  by  about  0.5  mm.  The  GRIN  lens  is  then 
positioned  in  the  glass  sleeve  such  that  the  emerging  beam  is  well 
collimated.  A  more  detailed  assembly  procedure  may  be  found  in 
section  4.3.2. 

Two  collimators,  a  source  and  a  receiver,  are  required  for  each 
PLA.  Single-mode  fiber  is  used  for  the  source  collimator,  because 
when  used  with  a  GRIN  lens,  single-mode  fiber  provides  better 
coll imat ion  of  the  output  beam  than  does  multimode  fiber.  This 
follows  since  the  HE^^  mode  is  nearly  Gaussian  and  has  less  beam 
divergence  than  that  of  multimode  fiber.  The  mode  field  diameter 
for  850  nm  single-mode  fiber  is  less  than  10  Mm,  and  this  beam  can 
be  subsequently  focused  by  a  lens  system  into  the  multimode  fiber 
of  the  receiver  with  very  little  loss. 

The  receiving  collimator  consists  of  a  GRIN  lens  and  a  few  meters 
of  multimode  fiber.  Multimode  fiber  is  used,  because  it  provides 
higher  coupling  efficiency  when  used  with  the  GRIN  lens.  The 
signal  quality  is  unaffected  by  transmission  over  such  short 
lengths  of  multimode  fiber. 

All  GRIN  lenses  are  anti-reflection  coated  for  lower  loss  and  to 
minimize  back  reflections  into  the  laser  diode. 
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4.  Experimental  results 


4.1.  Initial  Path  Length  Adjustment 

A  procedure  for  equalizing  all  four  path  lengths  using  a  networ*; 
analyzer  was  described  in  section  3.3.  One  port  of  the  analyzer 
drives  the  laser  diode  while  the  other  port  monitors  the  amplifier 
output.  S^2  measured  as  the  frequency  is  swept  over  an 
arbitrary  range  consistent  with  the  bandwidth  of  the  laser  diode, 
photodiodes,  and  amplifiers. 

Figure  4.1  shows  the  change  in  phase  for  one  leg  (channel  4)  as 
the  frequency  is  swept  from  9  to  10  GHz.  The  markers  record  the 
phase  at  four  specific  frequencies  for  comparison  with  the 
remaining  three  channels. 

Figures  4.2  -  4.4  show  the  phase  change  for  the  remaining  three 
legs  (channels  3,  2,  and  1,  respectively)  with  the  phase  change 
for  channel  4  also  appearing  in  each  of  the  figures  as  an  overlay. 

The  markers  are  at  the  same  frequencies  as  in  Fig.  4.1.  The 

figures  were  obtained  after  adjustments  were  made  to  equalize  the 
lengths  of  channel  4,  3,  2,  and  1  by  bringing  the  phases  of  all 
four  channels  into  coincidence. 

The  accuracy  to  which  the  path  length  difference  between  channels 
may  be  nulled  depends  on  the  accuracy  to  which  the  phase  (13)  may 
be  determined.  According  to  equation  3.1  the  phase  difference  0 
may  be  written  in  terms  of  the  length  difference  as 

(3  =  -2TT(f/c)^L  (4.1) 

where  T^  =  ^L/c  has  been  substituted.  If  (3  is  expressed  in 
degrees,  this  equation  may  be  solved  for  ^  L  as 

=  -  ((3/360*  )•  (30/f)  (4.2) 

where  the  frequency  is  in  GHz  and  AL  is  in  cm. 

At  an  operating  frequency  of  10  GHz,  eq.  4.2  indicates  that  a  1* 
variation  in  phase  results  in  a  path  length  difference  of  0.008  cm 
or  80  Aim.  Figures  4.1  -  4.4  show  an  average  phase  variation  of 
approximately  4*.  Thus,  the  path  lengths  are  equal  to  within 
approximately  300  Aim. 


4.2.  True-Time-Delay  Phase  Shift 

After  eqalizing  the  four  path  lengths  as  in  section  4.1,  the  path 
length  adjusters  may  be  progressively  incremented  by  an  amount  A  L 
and  the  phase  shift  recorded  and  compared  with  theory.  The  value 
A  L  13.125  mm  is  used,  because  it  represents  60*  beamsteering 
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when  used  with  a  microstrip  antenna  that  has  been  designed  for 
another  phase  of  this  project. 

According  to  eq.  4.1,  if  the  path  length  of  each  leg  is 
incremented  progressively  by  13.125  mm,  then  the  progressive  phase 
delay  ((3)  is  given  (in  degrees)  as 

(3  =  -  15.75®-f  (4.3) 

where  f  is  expressed  in  GHz. 

Thus  (3  decreases  linearly  with  f  and  has  the  value  141.75®  at 
f  =  9  GHz. 

Figure  4.5  shows  the  measured  phase  shifts  of  channels  1-4  for 
frequencies  9.0-9.32  GHz.  It  is  clear  from  the  figure  that  the 
phase  differences  between  channels  remain  constant  for  all 
frequencies,  a  required  characteristic  for  a  broadband  system. 

Table  4.1  lists  the  measured  value  of  phase  for  each  channel  at  9 
GHz  and  compares  the  progressive  phase  difference  between  adjacent 
channels  with  the  theoretical  value  of  141.75®  as  found  above. 


Table  4.1 


Measured  phase  of  each  channel  and  phase  differences  between 
adjacent  channels  at  f  =  9  GHz  for  =  13.125  mm.  Theoretical 

phase  difference  between  adjacent  channels  is  141.75®. 


channel 

phase 

meas 

4 

-119® 

— _ 

3 

98® 

143® 

2 

-45® 

143® 

1 

176® 

139® 

phase  difference 


Refer  to  Fig.  4.5  when  computing  the  phase  difference  between 
channels.  Note  that  phase  decreases  as  we  progress  from  channel  4 
to  channel  1.  Starting  at  -119®  at  channel  4  proceed  by  51®  to 
-180®  and  then  by  82®  from  4-180®  to  4-98®  at  channel  3.  The  phase 
shift  from  channel  4  to  channel  3  is  thus  51®  4-  82®  =  143®.  The 
remaining  channels  proceed  in  a  similar  manner. 

According  to  Fig.  4.5  the  phase  decreases  linearly  with  frequency 
(negative  slope)  as  required  by  equation  4.1.  The  shorter 
positive  slope  segments  of  the  curves  are  due  to  the  discrete 
rather  than  continuous  frequency  sweep.  Were  the  sweep  to  be 
continuous,  then  each  negative  slope  segment  would  fall  to  -160® 
and  rise  sharply  to  4180®  as  frequency  increased. 
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4.3.  Grin  Lens  Collimator 


4.3.1.  Loss  measurements 

With  proper  alignment  of  the  PLA  it  is  possible  to  make  length 
adjustments  of  over  IS  cm  with  only  a  5X  change  in  optical 
intensity  to  the  photodiode.  This  corresponds  to  a  lOX  change  in 
the  electrical  power  delivered  to  the  antenna.  Path  length 
adjustments  of  8  cm  or  less  result  in  a  change  in  optical 
intensity  of  less  than  2X.  Figure  4.6  shows  a  plot  of  the  actual 
intensity  change  versus  the  change  in  path  length  for  each  of  the 
four  channels.  The  overall  insertion  loss  is  plotted  against 
separation  for  each  PLA  in  Fig.  4.7. 

The  greater  variability  in  channel  4  dictates  its  choice  as  the 
reference  channel,  i.e.,  the  channel  for  which  the  path  length 
remains  fixed. 

4.3.2.  Assembly  procedure 

The  collimator  consists  of  a  glass  sleeve  <1)  of  inside  diameter 
1.83  mm,  the  elastomeric  subsection  of  a  GTE  lab  splice  <2),  and  a 
0.23  pitch  GRIN  lens  of  outside  diameter  1.80  mm  (3).  The  fiber 
is  centered  in  the  glass  sleeve  by  the  elastomeric  splice.  The 
elastomeric  splice  has  been  cut  in  half.  The  fiber  passes  through 
the  elastomeric  splice  and  protrudes  by  about  0.5  mm.  It  is  held 
firmly  in  place  by  the  splice.  The  GRIN  lens  is  then  cemented  in 
place  once  it  is  positioned  in  the  glass  sleeve  such  that  the 
emerging  beam  is  well  collimated. 

The  actual  insertion  of  the  fiber  into  the  elastomeric  splice 
takes  place  with  the  splice  only  partially  inserted  into  the  glass 
sleeve.  This  reduces  the  force  necessary  to  insert  the  fiber  and 
prevents  breakage.  The  splice  is  then  pushed  into  the  sleeve  with 
a  pair  of  tweezers.  A  sketch  of  the  assembled  collimator  may  be 
found  in  Fig.  3.4. 

The  GRIN  lens  is  focused  as  follows.  A  piece  of  thin  double  back 
tape  is  wrapped  over  the  cutting  edge  of  an  Ezacto  blade,  which  is 
then  secured  to  an  xyz  stage.  The  cutting  edge  is  gently  placed 
in  contact  with  the  cylindrical  edge  of  the  GRIN  lens.  With  the 
tape  adhering  to  the  lens  the  xyz  stage  is  used  to  move  the  lens 
in  and  out  until  the  beam  is  collimated.  After  the  final 
adjustment  the  lens  should  protrude  from  the  glass  sleeve  by 
several  mm.  U-V  curing  epoxy  is  applied  to  the  lens  where  it 
protrudes  from  the  glass  sleeve  followed  by  u-v  light  to  cure  the 
epoxy.  The  epoxy  may  be  applied  one  drop  at  a  time  using  a  small 
piece  of  optical  fiber  as  an  applicator. 
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5.  Conclusions 


The  apparatus  described  in  the  above  report  was  designed  to 
investigate  the  performance  of  a  fiber  optic  distribution  system 
used  for  beam  steering  with  a  phased  array  antenna  at  x-band 
microwave  radiation.  Discrete  components  were  used  to  breadboard 
the  system,  parts  of  which  might  later  be  put  into  integrated 
format. 

A  novel  design  for  the  fabrication  of  a  fiber  optic  collimator  and 
path  length  adjuster  was  described  and  test  results  cited. 

Although  the  apparatus  described  in  this  report  was  designed 
primarily  for  the  study  of  beam  steering,  it  has  proved  to  be  both 
versatile  and  reliable  and  could  find  applications  in  other 
studies.  For  example,  it  could  be  used  to  provide  multiple 
electrical  (or  optical)  signals  at  gigahertz  frequencies  with 
infinitely  variable  time  delay  from  less  than  one  picosecond  to 
several  nanoseconds.  Time  delays  could  be  programmed  quickly  and 
easily  by  mounting  the  appropriate  element  of  the  path  length 
adjuster  on  a  computer  driven  translation  stage.  This  would  also 
permit  automated  data  acquisition.  Moreover,  the  amplitude  of 
each  channel  could  be  adjusted  by  inserting  neutral  density 
filters  in  the  beam  within  the  PLAs.  This,  too,  could  be 
automated  using  a  variable  neutral  density  filter  disk. 
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Figure  3. 


Progressive  time  delay  between  antenne  elements 
caused  by  the  incremental  path  difference  results 
in  a  progressive  phase  shift  <(3).  The  beam  is 
directed  at  angle  6. 
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COAXIAL  CABLE 


Figure  3.2.  Block  diagram  of  a  four  element  phased  array  antenna. 


ALUMI 


Figure  3.4.  GRIN  lens  collimator. 
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Figure  4.2.  Phase  vs.  frequency  for  channel  3.  Channel  3  is  indicated  by  the 
markers.  Channel  4  is  shown  as  an  overlay. 


Figure  4.3.  Phase  vs.  frequency  for  channel  2.  Channel  2  is  indicated  by  the 
Barkers.  Channel  4  is  shown  as  an  overlay. 
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Figure  4.4.  Phase  vs.  frequency  for  channel  1.  Channel  1  is  indicated  by  the 
Barkers.  Channel  4  is  shown  as  an  overlay. 
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START  9.0000  GHz  STOP  9.3200  GHz 

Figura  4.S.  Phasa  shifts  vs.  fraquancy  for  channals  1-4  with  tha  PLAs  adjustad 
for  60*  baas  staaring. 
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ABSTRACT 


This  report  presents  the  theoretical  development  and  the 
simulation  of  an  adaptive  nonlinear  communications  receiver 
based  on  an  M-Interval  Polynomial  Approximation  (MIPA)  of  the 
Probability  Density  Function  (PDF)  for  the  received  signal 
envelope.  This  system  has  the  potential  to  perform  robustly 
for  a  wide  range  of  interference  scenarios.  Preliminary 
results  for  transmission  in  the  presence  of  Continuous  Wave 
(CW)  and  Wide  Band  (WB)  interference  are  presented  and 
discussed.  A  comparison  of  this  method  to  another  nonlinear 
processor  is  performed.  The  preliminary  results  indicate  that 
the  performance  of  the  various  simulations  is  similar,  and 
that  all  of  the  simulations  perform  substantially  better  than 
linear  receivers.  Recommendations  for  continued  topics  of 
research  are  also  discussed. 
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1 


INTRODUCTIOM 


Many  facets  are  involved  in  the  design  of  a 
communications  system.  One  of  considerable  importance  is 
determining  a  method  to  recover  the  transmitted  signal  when  it 
is  subjected  to  interference  in  the  transmission  path.  The 
first  step  in  the  process  is  choosing  a  suitable  interference 
model.  Using  this  model,  it  is  possible  to  design  a  receiver 
that  will  either  remove  the  effects  of  the  interference  on  the 
transmitted  signal,  or  operate  on  the  transmitted  signal  plus 
interference  in  such  a  way  as  to  make  determination  of  the 
correct  transmitted  waveform  possible.  This  subject  is  the 
focus  of  the  current  study. 

For  many  practical  applications,  it  is  possible  to  model 
interference  using  a  Gaussian  Probability  Density  Function 
(PDF) ,  given  by: 

fff(n)= h — xe  (1”1) 

where:  ff(t)  is  the  noise  random  process, 

fi  is  the  mean  of  the  noise, 
is  the  variance. 

The  optimum  processor  for  this  type  of  interference  is  a 
linear  processor  which  incorporates  the  use  of  a  matched 
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filter  for  signal  recovery.  The  matched  filter  impulse 
response  is: 

h(t)  =kx(T-t)  (1-2) 

where:  h(t)  is  the  matched  filter  impulse  response, 

x{t)  is  the  transmitted  signal  waveform  (real) , 
T  is  the  period  of  the  transmitted  waveform, 
k  is  an  arbitrary  constant. 

Alternately,  a  time  correlator  realization  of  the  matched 
filter  can  be  used  for  signal  recovery.  The  time  correlator 
is  derived  from  examining  the  output  of  the  matched  filter  at 
time  t=T,  which  can  be  written  as: 

T 

y(T)  =jr(t)  xx(t)  dt  (1-3) 

0 

where:  y(T)  is  the  output  of  the  matched  filter  at  time 

T, 

r{t)  is  the  received  signal, 

x{t)  is  the  transmitted  signal  waveform. 

Either  of  these  processing  techniques  can  be  used  by  a 
receiver  to  recover  a  transmitted  signal  in  the  presence  of 
Gaussian  noise.  When  operating  in  a  two-dimensional 
environment,  these  operations  must  be  performed  on  both  the 
In-phase  (I)  and  Quadrature  (Q)  channels. 
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However,  in  many  applications  the  interference  is  of  a 
non-Gaussian  nature.  Examples  iiiclude  Continuous  Wave  (CW) 
i*.terference,  impulsive  interference,  and  non-Gaussian  Wide 
Band  (WB)  interference.  In  these  cases  it  is  necessary  to 
employ  nonlinear  processing  techniques  to  recover  the 
transmitted  waveform  at  the  receiver.  Examples  of  such 
processing  have  been  described  in  a  document  by  Hazeltine 
Corporation^ and  by  J.  L.  Higbie^^K  Their  methods  are 
compared  in  Appendix  A  for  the  case  of  a  two-dimensional 
signal  environment. 

The  method  investigated  in  this  study  utilizes  the 
likelihood  function  of  statistical  detection  theory^^^.  This 
technique  requires  determination  of  the  received  signal  PDF. 
One  procedure  which  evolved  from  this  approach  utilizes  the 
Locally  Optimal  (LO)  maximum  likelihood  function^^^.  For  a 
two-dimensional  receiver,  the  LO  likelihood  function  is 
derived  in  Appendix  B  and  has  the  form; 

Choose  fj  and  j=l-N,  to  maximize 
_  K 

Lj=Yi  cosex g (r) +Sj^jXsin0xg(r) }  (1-4) 
kxi 


with. 


g(r) 


dr  \  1 
fair)  r 


(1-5) 
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where :  Lj 


fei^) 


^kjt  ^kj 

N 


K 


r 

0 

g(r) 


is  the  LO  maximum  likelihood  function  for 

the  transmitted  signal  pair, 

is  the  received  envelope  PDF, 

are  the  transmitted  signal  vectors  for 

the  I  and  Q  channel  respectively, 

is  the  of  K  samples  used  to  represent 

the  transmitted  signal  pair, 

is  the  number  of  total  possible 

transmitted  signal  pairs  in  the  signal 

space, 

is  the  length  of  a  signal  vector,  i.e. 
the  number  of  samples  per  symbol, 
is  the  received  magnitude, 
is  the  received  phase  angle, 
is  the  optimum  nonlinearity. 


In  practice,  however,  it  is  difficult  to  construct  a 
continuous  PDF  or  to  take  a  continuous  derivative  of  a 
discrete  function.  Therefore,  it  is  necessary  to  use  a  method 
of  approximation  to  develop  the  optimum  nonlinearity,  g(r)  . 
Two  methods  described  by  Hazeltine  Corporation^ are  examined 
in  this  study:  the  histogram  approximation  “.nd  the  M-Interval 
Polynomial  Approximation  (MIPA) .  In  statistics,  the  histogram 
method  is  the  classical  approach  for  approximating  a  PDF.  The 
MIPA  algorithm  is  an  extension  of  the  histogram  approach  and 
utilizes  a  set  of  polynomial  functions  to  approximate  the  PDF. 
In  the  case  at  hand,  this  set  of  functions  as  a  whole 
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describes  completely  the  entire  PDF  of  the  received  signal 
envelope.  An  added  benefit  to  t^'■s  approach  is  that  the 
derivative  of  a  polynomial  is  easily  calculated.  Therefore, 
this  algorithm  allows  for  approximation  of  the  optimum 
nonlinearity  in  real  time.  Notice  that  a  zero-order  MIPA 
reduces  to  a  histogram.  Thus,  these  two  methods  are 
interrelated. 

In  addition  to  examining  the  MIPA  implementation  of  the 
LO  maximum  likelihood  function,  a  no*^  ii’'.'5ar  processing 
technique  developed  by  J.  L.  Higbie^^^  was  examined,  and  a 
preliminary  comparison  to  the  MIPA  algoritl  i  >/a3  made.  The 
Higbie  approach  was  implemented  by  Rome  Laboratories  (RL) 
(formerly  Rome  Air  Development  Center  (RADC) )  in  the  current 
Adaptive  Nonlinear  Coherent  Processor  (ANCP)  simulation.  A 
theoretical  comparison  between  the  LO  and  Higbie  optimum 
nonlinearities  is  presented  in  Appendix  A. 

The  development  of  the  MIPA  algorithm  and  the 
implementation  of  the  LO  maximum  likelihood  function  have 
comprised  the  major  efforts  of  the  current  study.  The 
following  sections  present  the  results  and  conclusions  of  the 
simulation  of  these  techniques,  as  well  as  comparisons  to  the 
Higbie  approach.  Also,  because  of  discrepancies  in  the 
Hazeltine  report,  the  derivation  of  the  LO  maximum  likelihood 
function  for  two-dimensional  signals  is  presented  in 
Appendix  B. 
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2 .  OBJECTIVES 


The  ultimate  goal  of  this  research  is  to  determine  an 
effective  means  for  mitigating  the  effects  of  non-Gaussian 
interference  in  a  communications  system.  To  achieve  this  task 
many  intermediate  goals  must  be  accomplished.  As  stated  in 
the  proposal  for  the  current  project,  these  include: 

1)  Examine  the  M-Interval  Polynomial  Approximation  (MIPA) 
approach  to  modeling  a  Probability  Density  Function 
(PDF)  .  Extend  this  theory  to  a  two  dimensional,  i.e.  in- 
phase  and  quadrature  channel,  receiver. 

2)  Integrate  the  MIPA  implementation  of  the  Locally  Optimal 
(LO)  maximum  likelihood  function  into  the  existing 
Adaptive  Nonlinear  Coherent  Processor  (ANCP)  simulation. 

3)  Compare  the  performance  of  the  MIPA  to  the  histogram 
approximation  of  the  PDF. 

4)  Determine  the  optimal  order  of  polynomial  and  number  of 
intervals  for  the  MIPA. 

5)  Incorporate  Bit  Error  Rate  (BER)  as  a  performance  metric. 
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Test  the  system  under  various  forms  of  modulation  such  as 
Offset  Quaternary  Phase  Shift  Keying  (OQPSK)  and  Minimum 
Shift  Keying  (MSK) . 

Optimize  the  simulation  for  efficient  and  fast 
performance. 

The  current  status  of  each  intermediate  task  is: 

Development  of  the  MIPA  algorithm  for  approximating  the 
received  envelope  PDF  was  accomplished. 

Integration  of  the  MIPA  implementation  of  the  LO  receiver 
into  the  ANCP  simulation  was  accomplished. 

Preliminary  comparison  between  the  MIPA  and  the  histogram 
approximation  of  the  received  envelope  PDF  was  performed. 

Preliminary  comparisons  concerning  the  optimum  order  and 
number  of  intervals  for  the  MIPA  algorithm  was  performed. 
More  investigation  is  necessary  before  conclusions  can  be 
formulated. 

Incorporation  of  BER  analysis  will  require  extensive 
modification  of  the  current  simulation.  This  task  will 
be  addressed  in  future  research. 
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6) 


Through  consultation  with  Rome  Laboratories  (RL)  it  was 
determined  that  studying  MSK  in  detail  would  be  more 
beneficial  than  examining  many  different  modulation 
schemes . 

7)  Preliminary  methods  for  optimization  were  used  in  the 
MIPA  portion  of  the  current  simulation.  More  efforts  are 
reguired  to  optimize  the  entire  ANCP  simulation. 
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3.  THEORETICAL  DEVELOPMENT  OF  THE  N-INTERVAL  POLYNOMIAL 
APPROXIMATION  (MIPA)  NONLINEAR  PROCESSOR 

3.1  DEVELOPMENT  OF  THE  LOCALLY  OPTIMAL  MAXIMUM  LIKELIHOOD 
FUNCTION  FOR  TWO-DIMENSIONAL  SIGNALS 

In  modern  communications  systems,  it  is  necessary  to 
develop  methods  to  improve  signal  reception  when  interference 
signals  are  present  in  the  transmission  path.  The  use  of 
nonlinear  signal  processing  based  on  the  likelihood  function 
is  one  such  method.  In  particular,  the  focus  of  the  current 
study  is  on  developing  the  likelihood  function  for  complex,  or 
two-dimensional,  signals  and  then  using  an  M-Interval 
Polynomial  Approximation  (MIPA)  Algorithm  for  implementation. 
A  full  development  of  the  likelihood  function  is  presented  in 
Appendix  B,  but  the  key  steps  are  shown  here. 

The  transmitted,  or  source,  signal  is  represented  by  a 
random  process  with  an  in-phase  component  R(t)  and  a 
quadrature  component  S(t).  For  the  received  signal,  the  in- 
phase  component  is  I(t)  and  the  quadrature  component  is  Q(t)  . 
Finally,  the  interference,  or  noise,  is  characterized  by  an 
in-phase  signal  X(t) ,  and  a  quadrature  signal  Y(t).  The 
received  signal  is  sampled  once  per  symbol  period.  The 
resulting  relationship  is: 

J=J?+X  Q=S+Y  (3-1) 


8-12 


The  received  signal  joint  PDF  is  given  by: 


•f(S4>aD 

=  |  fxY(i-r,q-s)  drds 

-CD -2^00 


(3-2) 


where:  fng(r,s)  is  the  joint  PDF  of  the  transmitted 

signal, 

fxY(x,y)  is  the  joint  PDF  of  the  interference. 

If  the  transmitted  signal  is  one  of  N  equiprobable  transmitted 
signal  pairs,  then  Eq.  (3-2)  reduces  to: 


(3-3) 


Where:  and  Sj  are  one  of  N  possible  transmitted 

signal  pairs. 

For  the  li)celihood  function,  it  is  desired  to  maximize 
P(i?,5|J,0)  ,  the  probability  that  R=rj  and  S=Sj  given  thatl=i 
and  Q=q ,  where  i  and  q  are  )cnown.  This  is  equivalent  to 
choosing  and  Sj  to  maximize  fxY(.i-^jr<J-Sj)  . 

If  each  signal  pair  is  sampled  to  form  vectors  of  length 
K,  i.e.  I=R*X  and  Q-~S*Y ,  Eq.  (3-3)  must  be  modified.  When 
independent  and  identically  distributed  samples  are  assumed, 
the  corresponding  likelihood  function  becomes: 
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Choose  i*j  and  Sj,  j=l->N,  to  maximize: 

X 

V=n  ^XYUk-rjk><ik-Sjk)  <3-4) 

Where:  K  is  the  length  of  a  signal  vector, 

qj^  is  the  sample  of  the  received  signal 
for  the  I  and  Q  channels  respectively. 

To  simplify  calculations,  the  natural  logarithm  of  Eq. 
(3-4)  is  used.  This  results  in  the  Globally  Optimal  (GO) 
maximum  likelihood  function: 

Choose  and  Sj ,  j=l-*N,  to  maximize 

X 

]  <3“5) 

x=i 

For  a  large  Jammer  to  Signal  (J/S)^  ratio,  a  first-order 
Taylor  Series  expansion  around  the  received  signal  point  can 
be  used  to  approximate  the  interference  PDF.  The  expansion  is 
valid  because  the  deviation  of  the  interference  from  the 
received  signal  point  is  minimal  for  a  large  J/s.  This 
simplification  results  in  the  Locally  Optimal  (LO)  maximum 
likelihood  function: 

typical  jammer  la  usually  at  least  20dB  greater  than  the  transmitted 
signal.  Therefore,  for  a  signal  environment  where  jammers  are  present,  a  large 
J/S  assumption  is  valid. 
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Choose  fj  and  Sj,  j  =  l-*N,  to  maximize 


^  -^^IQi^kf^k)  I  (3-6) 

fiQi^k>(Jk)  fiQUk,qk) 


where:  figi')  is  the  received  signal  PDF. 


However,  because  of  the  limited  ability  of  the  simulator 
to  approximate  continuous  functions,  determination  of  the 
received  signal  joint  PDF  can  be  a  difficult  process,  and 
determination  of  the  partial  derivatives  even  more  so.  To 
reduce  the  complexity  of  the  likelihood  function,  complex 
bivariate  radial  symmetry  of  the  received  signal  PDF  is 
assumed.  Therefore,  the  received  signal  PDF  is  given  by: 


t<l)  - 


2Tir  ' 


O<0<27r 

elsewhere 


(3-7) 


where:  fe(r)  is  the  received  envelope  PDF, 

r=vi^+g^  is  the  received  magnitude, 

0=arctan5  is  the  received  phase  angle. 

This  assumption  is  valid  because  interference  sources  of 
interest  will  not  have  predictable  phase  angles.  Even  a 
constant-frequency  waveform  will  have  a  vector  that  rotates  at 
a  constant  rate  and  is  therefore  equally  likely  at  any 
angle. With  this  assumption  the  likelihood  function 
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reduces  to  Eq.  (1-4),  repeated  below: 


Choose  fj  and  Sj ,  j  =  to  maximize 


K 

h-I. 


k*l 


xcos6x 


fe(r)  ^ 


,xsin0x 


f^ir)  r 


(3-8) 


This  is  the  likelihood  function  that  is  implemented  in 
the  ANCP  simulation.  Of  importance  is  the  optimum 
nonlinearity  of  Eq.  (1-5),  also  repeated  below  for 
convenience: 


1 

q(r)  =--zf_+J: 
^  fe(r)  r 


(3-9) 


The  MIPA  algorithm  is  one  method  by  which  the  optimum 
nonlinearity  can  be  computed. 
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3.2  DEVELOPMEKT  OF  THE  MIPA  ALGORITHM^'^ 

The  nonlinear  function  expressed  in  Eq.  (3-9)  requires 
knowledge  of  the  probability  density  function  (PDF)  of  the 
received  signal.  Since  it  is  difficult  to  determine  the  exact 
value  of  the  PDF,  an  approximation  must  be  made.  The  two 
approximations  investigated  are  the  histogram  approximation 
and  the  M-Interval  Polynomial  Approximation  (MIPA) . 

The  acronym  MIPA  requires  further  explanation.  The  term 
M-Interval  is  used  because  the  received  signal  PDF  is  divided 
into  M  intervals.  The  Polynomial  Approximation  term  is  used 
because  the  PDF  is  subsequently  approximated  by  a  separate 
polynomial  over  each  interval.  Thus,  a  MIPA  is  simply  a 
concatenation  of  polynomial  curves  used  to  approximate  the 
actual  PDF  of  the  received  signal.  MIPAs  of  order  0,  2,  and 
4  are  investigated  in  this  report,  where  the  order  refers  to 
the  order  of  the  polynomials  used.  It  turns  out  that  the  zero 
order  MIPA  reduces  to  a  histogram  since  zero  order  polynomials 
are  constants  which  produce  horizontal  lines.  The  boundaries, 
or  breakpoints  of  each  interval  are  determined  by: 

. . .  ,M  (3-10) 

where:  h^  is  the  value  of  the  m***  breakpoint, 
v^j,  is  the  maximum  value  of  the  PDF, 
v^j^  is  the  minimum  value  of  the  PDF, 

M  is  the  number  of  intervals. 
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The  symbols  a  and  P  denote  the  lower  and  upper  breakpoints, 
respectively,  of  the  interval  of  interest. 

Since  a  separat‘d  polynomial  is  used  to  approximate  each 
interval,  the  PDF  of  the  received  signal  conditioned  on  the 
interval  of  interest  is  required.  This  is  written  as: 

f„(r)  -f,(rla<r<&)  -  .  _  (3-11) 

where:  f^{r)  is  the  received  envelope  PDF, 

P(a<r<R)  is  the  probability  that  r  is  in  the 
interval  [a,fi] . 

The  general  form  of  the  approximating  polynomial  is: 

Where:  /ec(^)  is  the  MIPA  of  , 

P  is  the  order  of  the  polynomial, 
b,^  are  the  polynomial  coefficients. 

Values  for  b^  must  be  determined  such  that  the  polynomial  best 
approximates  the  PDF  according  to  some  error  criteria.  A 
common  and  tractable  method  is  the  least  squares  error 
minimization^^) ,  Applying  this  yields: 


8-18 


(3-13) 


B 

c-J(^ec(^)  -f^(r)fdr 

a 

where:  e  is  the  error  to  be  minimized. 

The  general  form  of  the  approximating  polynomial  from  Eg.  (3- 
12)  can  be  substituted  for  f^ir)  into  Eq.  (3-13). 


e 


(3-14) 


where:  e (B)  is  the  error  as  a  function  of  B, 

B  is  the  polynomial  coefficient  vector  [bg  . 


The  error  can  be  minimized  for  each  coefficient  by  taJcing  the 
derivative  of  e(jb,)  with  respect  to  each  bj,  setting  it  equal 
to  zero,  and  solving  for  b,.. 


de  (Jb,.) 
dJb, 


B 


-1 


-2r' 


k-0 


i-0,1, 


,P  (3-15) 
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« 


simplifying  Eq.  (3-15)  yields 


~  r 

ffec  r'dr-  fr‘53 

J  J  t--n 


(3-16) 


The  term  on  the  left  side  of  Eq.  (3-16)  is  just  the  i***  moment 
of  i*e-  fflxj*  Solving  the  integral  on  the  right  side 
yields: 


j.+;c+l  '  /  /  / 


(3-17) 


Equation  (3-17)  can  be  written  in  matrix  form  by  letting  i  be 
the  row  index  and  k  be  the  column  index.  Thus,  MIPA 
coefficients  of  general  order  are  given  by: 


B-a 

B^-a^ 

^X1 

1 

fl2-a2 

2 

B^-a^ 

2 

gP*i_^P^i 

3 

®XP 

P+1 

"TTi 


2P+1 


(3-18) 


For  notational  purposes,  Eq.  (3-18)  can  be  written  as: 


(3-19) 
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(3-20) 


5-/f;Vn^ 


where:  is  the  (P+l)xl  vector  of  moments, 

is  the  (P+1) X (P+1)  matrix  of  intervals. 

The  vector  5  can  be  computed  directly  from  Eq.  (3-20)  since 
m^,  can  be  computed  from  the  received  signal,  a  and  P  can  be 
computed  from  M,  and  P  is  a  parameter  of  the  simulation. 
However,  this  method  requires  the  inversion  of  H^,  wnich  is  a 
time  consuming  task.  The  performance  of  the  simulator  can  be 
improved  significantly  by  avoiding  this  computation. 

The  matrix  relationships  shown  i.i  Eq.  (3-19)  and  Eq.  (3- 
20)  can  be  normalized  by  mapping  the  interval  [a,fl]  onto  the 
interval  [0,1]  using  the  linear  mapping: 


y-5-f! 

b  -a 


Under  this  mapping. 


H,- 


1 

1 

1 

2 


1 

P+1 


1 

2 

1 

3 


1 

P+1 


1 

2P+1 


n-21) 


(3-22) 
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1 


Note  from  Eq.  (3-22)  that  Hy  is  a  constant,  and  therefore  so 

I 

is  Hy'\  Thus,  the  inverse  of  Hy  need  only  be  computed  once  • 

when  the  receiver  is  initially  activated,  without  the  linear 
mapping,  would  need  to  be  computed  for  every  interval  of 
the  PDF.  The  moment  vector  Mf  of  the  random  variable  Y  may  be  ' 

determined  using  the  following  equations'^^ : 

raYi-Jy‘^tc(y)dy  (3-23) 

1 

or, 

(3-24) 

where:  f^^Cr)  is  f^(jHa<r<6)  ,  the  conditional  signal  PDF 
with  r  as  index, 

fyg(y)  is  fy{'^<y<&)  ,  the  conditional  signal  PDF 
with  y  as  index, 

myj  is  the  i^**  moment  of  fyj.(y)  . 

Equation  (3-24)  can  be  written  in  matrix  form  by  letting  i  be 
the  row  index: 
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1 


7n,i-a 

(6-a) 

(B-at)2 

”»x3-3om>x2^3aX,-a3 

(S-a)^ 


(3-25) 


Equations  (3-22)  and  (3-25)  can  be  used  to  find 


(3-26) 


The  normalized  coefficient  vector  <?  is  an  intermediate  vector 
used  for  computational  efficiency.  A  transition  matrix  Q  is 
necessary  to  convert  3  to  B,  i.e. 


B~Qx3 


(3-27) 


The  required  Q  is: 


0- 


-a 


(-1)  V 


(B-a)  (6-a)2  (ft-c)^ 

1  -2a 

(fl-a)2  (6-a)^ 

«  1 


(B-a) 


0 

0 


(B-o)3 


(6_a)<P*i) 


(3-28) 


8-23 


Although  Q  must  be  calculated  for  every  interval,  it  need  not 
be  inverted,  thus  substantial  computational  savings  result. 

The  final  values  necessary  for  use  in  the  simulation  are 
scaled  values  of  B.  The  following  scaling  removes  the 
conditional  nature  of  the  PDF: 

a-P(a<r<6)5  (3-29) 

where:  a-[ao  a^  •  ap]  are  the  coefficients  for  f^^r)  . 
Thus,  the  simulation  performs  the  following  computation  for 
each  interval: 

(3-30) 

k-O 

Where:  a-(5x/fy’xiHyXP(a<r<6)  . 

The  calculation  of  the  optimum  nonlinearity  and 
implementation  of  the  LO  maximum  likelihood  function  requires 
the  derivative  of  f^ir)  .  This  derivative  is  formed  by  taking 
the  derivative  of  the  approximating  polynomial  of  fe(r)  for 
each  interval.  Since  each  interval  of  the  received  envelope 
PDF  is  represented  by  functions  of  the  form: 

fe(r) -aQ+a,r+-+apr'’  (3-31) 
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the  derivative  is  given  by  functions  of  the  form: 


(3-32) 


where:  g,-ixa,-  , 

This  indexing  results  in  efficient  looping  algorithms  in  the 
simulation. 

Therefore,  the  optimum  nonlinearity  given  by  Eq.  (3-9)  is 
approximated  using  the  expression: 


^(r)- 


g,^•g2r^-•^^^gpr'*~^ 


r 


(3-33) 
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4. 


SIKULATION  OF  MIPA  NONLINEAR  PROCESSOR 


This  section  discusses  the  implementation  of  the  MIPA 
nonlinear  processor  in  the  current  Adaptive  Nonlinear  Coherent 
Processor  (ANCP)  simulation.  The  purpose  of  this  discussion 
is  to  illustrate  the  key  considerations  and  data  flow 
necessary  for  this  algorithm.  A  pictorial  view  of  the  MIPA 
nonlinear  processor  is  shown  in  Fig.  (4-1) . 


MIPA  NONLINEAR  PROCESSOR  BLOCK  DIAGRAM 


Initialization 


Block  1 


Block  2 


Block  3 


4 


4 


I 


Figure  (4-1) 
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The  current  simulation  supports  zeroth  (histogram) , 
second  and  fourth  order  MIPA  approaches.  The  user  inputs  are: 
1)  enable  MIPA  processing,  2)  provide  the  order  of  MIPA 
approximation,  and  3)  provide  the  number  of  intervals  used  for 
approximation.  During  simulator  initialization  it  is 
necessary  to  create  the  constructs  that  are  stored  in  the 
nonlinear  processor  and  used  in  each  successive  decision 
cycle.  These  include:  the  inverse  coefficient  calculation 
matrices,  Hy~^ ,  used  for  computation  of  the  normalized  MIPA 
interval  coefficients;  the  coefficients  of  elements  in  the 
transition  matrix,  Q,  used  for  conversion  between  normalized 
and  actual  MIPA  interval  coefficients;  and  the  eight  MSK 
reference  transmitted  waveforms^^J . 

Of  interest  is  the  transition  matrix,  Q,  given  by  Eg.  (3- 

28)  .  As  can  be  seen,  each  column  contains  constants 

multiplied  by  - ,  where  j  is  the  column  index  and  i  is 

(6-a) 

the  row  index.  With  this  symmetry,  it  is  possible  to  construct 

a  matrix  consisting  of  the  constant  coefficients  of  thec? 

matrix.  During  simulation  execution,  an  efficient  algorithm 

is  used  which  multiplies  each  successive  column  of  this  matrix 

by  — —  and  ,  accordingly,  to  compute  the  actual 

(fl-a) 

transition  matrix.  This  eliminates  the  need  for 

exponentiation,  a  computationally  expensive  operation. 
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Once  this  information  is  stored  in  memory,  it  is  then 
possible  to  begin  execution  of  the  nonlinear  receiver  by 
invoking  the  first  block  of  the  MIPA  processor.  Since  the 
optimum  nonlinearity  utilizes  the  envelope  PDF  of  the  received 
signal,  fe(r)  ,  the  first  step  is  to  calculate  the  magnitude 
and  phase  of  the  received  signal.  With  this  information  it  is 
then  possible  to  create  a  number  of  equal  length  partitions 
for  local  approximation  of  fg(r)  .  The  number  of  partitions 
is  a  user  defined  input.  Next,  the  probability  of  each 
interval  is  determined  and  the  moments  for  each  interval  are 
calculated.  The  moments  are  approximated  using  the  equation: 


where:  is  the  i^^  moment,  i  =  l,",P,  of  the 

interval,  m=l,-,M, 

is  the  number  of  received  samples  that  lie 
between  a  and  6  of  the  m^^  interval. 

If  a  second  order  MIPA  is  chosen,  the  first  and  second 
moments  are  calculated.  For  a  fourth  order  MIPA,  the  first 
through  fourth  moments  are  calculated.  Next,  the  normalized 
interval  moments  are  calculated  using  Eg.  (3-25) .  The 
normalized  interval  moments  are  used  to  determine  the  MIPA 
interval  coefficients,  represented  by  the  vector  a.  These 
coefficients  are  the  polynomial  coefficients  used  to  locally 
approximate  fe(r)  . 
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The  second  block  of  the  MIPA  processor  calculates  the 
received  envelope  PDF  approximation,  fe(r)  .  This  is 
accomplished  in  three  steps.  First,  the  normalized  MIPA 
interval  coefficients  are  calculated.  This  is  done  using  Eq. 
(3-26)  .  Next,  Eq.  (3-27)  is  used  to  calculate  the 
approximating  coefficients  for  the  received  envelope  PDF 
conditioned  on  the  current  interval,  /^(rl  (a<r<B))  .  The  third 
step  is  the  calculation  of  the  actual  MIPA  coefficients  used 
for  approximation  of  fg(r)  .  This  is  accomplished  by  scaling 
each  of  the  coefficient  vectors  of  /g(r |  (a<r<fi) )  by  the 
probability  of  the  corresponding  interval, as  in  Eq.  (3-29) . 


The  calculation  of  the  optimum  nonlinearity  and 
implementation  of  the  LO  maximum  likelihood  function  are 
accomplished  in  the  third  block  of  the  MIPA  processor.  The 
first  stage  of  this  block  is  the  calculation  of  the  derivative 
of  fg(r)  using  Eq.  (3-32)  .  The  second  stage  of  this  block 
performs  simultaneous  calculation  of  the  optimum  nonlinearity 
for  each  received  magnitude,  and  calculation  of  the  likelihood 
function  for  each  possible  transmitted  waveform.  First,  the 
optimum  nonlinearity  is  calculated  for  the  current  received 
magnitude  sample  using  Eq.  (3-33) .  Then,  the  contribution  of 
this  sample  to  the  likelihood  function  of  each  possible 
transmitted  waveform  is  calculated  using: 

l^j^r^jXcoaQy  [gir)  ]  ^Sj^^xsinOx  [g(r)  ]  (4-2) 


where: 


K 

-Lr-J. 

k^l 
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This  calculation  is  performed  for  a  set  of  K  received 
magnitude  samples.  Once  completed,  the  largest  likelihood 
function  value  is  chosen,  and  the  corresponding  reference 
waveform  is  used  to  replace  the  received  I  and  Q  channel 
signals.  The  reference  waveforms  used  are  those  that  were 
stored  in  the  processor  during  initialization.  This  process 
continues  until  all  of  the  received  signal  data  is  exhausted. 


The  histogram  implementation  of  the  LO  maximum  likelihood 
function  is  currently  constructed  in  parallel  with  the 
order  MIPA  implementation.  They  are  quite  similar,  except  in 
the  formulation  of  the  optimum  nonlinearity.  For  the 
histogram  approach,  it  is  difficult  to  take  a  continuous 
derivative  because  of  its  discrete  nature.  By  noticing  that: 


(4-3) 


it  is  possible  to  estimate  the  histogram  nonlinearity  using 
the  relation: 


where: 

r„,  m=l,-,M,  is 

the  value 

of 

the 

interval 

nearest  in 

magnitude 

to 

the 

current 

received  signal  sample. 
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This  approximation  is  used  to  calculate  the  likelihood 
function  for  each  reference  waveform.  As  in  the  MIPA 
implementation,  the  transmitted  signal  waveform  with  the 
largest  likelihood  function  value  is  used  to  replace  the 
received  I  and  Q  signals. 

In  either  case,  the  resulting  received  signal  waveform  is 
passed  to  the  rest  of  the  ANCP  simulation  for  demodulation  and 
decoding.  The  next  section  illustrates  the  results  of  the 
MIPA  and  histogram  implementations. 
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5. 


RESULTS 


The  Adaptive  Nonlinear  Coherent  Processor  (ANCP) 
simulation  uses  the  MSK  encoded  message  shown  in  Fig.  (5-1) 
for  each  trial.  The  entire  frequency  spectrum  of  this  message 
(generated  from  a  1024  point  Fast  Fourier  Transform)  is  shown 
in  Fig.  (5-2)  .  This  spectrum  shows  both  magnitude  and  phase, 
and  was  verified  using  PC-MATLAB^**  To  compare  Fig.  (5-2) 
to  known  MSK  spectra ,  a  scatter  plot  in  deciBels  (dB)  of 
the  lower  frequency  components  for  this  spectrum  shifted  to 
baseband  is  shown  in  Fig.  (5-3)  .  This  spectrum  has  been 
normalized  by  dividing  all  frequency  components  by  the  highest 
magnitude. 


’PC-MATLAB  is  a  registered  trademark  of  The  MATH  WORKS  Inc. 
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Figure  5-3 


Various  types  of  interference  are  added  to  this  message 
by  the  simulator.  A  major  consideration  is  large 
interference,  usually  greater  than  20dB  above  the  signal 
power.  To  gain  a  better  perspective  of  how  the  interference 
affects  the  message  it  is  helpful  to  look  at  the  resultant 
signal  when  the  interference  and  the  message  have 
approximately  the  same  power.  Figures  (5-4)  and  (5-5)  show 
the  time  and  frequency  domain  representations,  respectively, 
for  a  signal  environment  with  this  type  of  CW  interference. 
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Figure  (5-6)  is  the  corresponding  scatter  plot.  Note  that  the 

*  CW  spike  appears  in  Fig.  (5-5),  but  not  in  Fig.  (5-6). 
This  is  because  of  the  reduced  bandwidth  displayed  in  the 
scatter  plot.  Since  the  CW  spike  determines  the  normalization 

•  factor,  the  entire  plot  is  shifted  down  lOdB  from  the 
comparable  plot  in  Fig.  (5-3). 

» 

Figures  (5-7)  and  (5-8)  show  the  time  and  frequency 
domain  representation  for  WB  interference  that  is  the  same 

► 

power  as  the  message,  and  Fig.  (5-9)  is  the  scatter  plot  of 
the  baseband.  The  upper  trace  on  the  scatter  plot  is  the 
'  signal  plus  interference,  and  the  lower  trace  is  the  reference 

message  with  no  interference  added. 

It  is  the  objective  of  this  research  to  determine  methods 
to  recover  the  message  from  the  corrupted  signal.  The 
preliminary  results  of  this  simulation  are  encouraging  and 
provide  an  impetus  for  continuing  research  in  this  area.  For 
a  performance  metric  the  present  simulation  uses  number  of 
bits  in  error  out  of  128  message  bits.  For  debugging  purposes 
and  preliminary  measurements  this  is  a  convenient  method  for 
making  general  comparisons.  A  better  performance  metric  would 
be  Bit  Error  Rate  (BER)  ,  based  on  a  much  larger  number  of 
bits.  However,  a  BER  test  would  require  significant 
modifications  to  the  ANCP  simulation  and  would  take  a 
substantial  amount  of  time  to  compute.  This  issue  is  best 
addressed  as  an  aspect  of  future  research. 
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The  following  three  examples  are  indicative  of  the 
capabilities  of  the  ANCP  simulation.  The  simulation  was 
performed  with  MIPAs  of  order  0,  2,  and  4  for  different 

interference  scenarios.  More  tests  need  to  be  performed  before 
concrete  conclusions  can  be  formulated. 

The  first  example  to  be  considered  involves  the  MIPA 
implementation  of  the  LO  demodulator  with  the  following 
parameters: 

CW  Interference  =  30dB 

WP  Interference  =  -30dB 

Polynomial  order  =  0 

Number  of  Intervals  =  8 

The  simulation  first  adds  the  interference  to  the  message. 
Figures  (5-10)  and  (5-11)  show  the  time  and  frequency  domain 
representations  of  message  plus  interference,  respectively. 
A  comparison  of  Fig.  (5-10)  with  Fig.  (5-1)  shows  how  much 
larger  the  interference  is  than  the  message.  The  simulator 


Figxura  5-10 


Figure  5-ii 


approximates  the  PDF  of  the  message  plus  interference  as  shown 
in  Fig.  (5-12)  .  From  the  PDF  the  Memory  less  Nonlinear 
Transform  (MNT)  is  computed,  as  shown  in  Fig.  (5-13)  .  The  MNT 
is  applied  to  the  signal  and  the  original  message  is 
extracted.  The  current  ANCP  simulation  plots  the  first  34 
bits  of  the  In-Phase  and  Quadrature  decoded  messages,  as  shown 
in  Figs.  (5-14)  and  (5-15)  respectively.  The  simulator  first 
plots  the  decoded  message  and  then  superimposes  the  original 
message  over  it.  Since  only  one  waveform  is  seen  in  Figs. 
(5-14)  and  (5-15) ,  the  decoded  message  matches  the  original 
perfectly  for  the  first  34  bits. 
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Figur*  5-12 


> 


Figure  5-14 
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Another  interesting  example  is: 

CW  Interference  =  -30  dB 

WB  Interference  =  30  dB 

Polynomial  order  =  2 

Number  of  intervals  =  2 

The  time  domain  of  the  message  plus  interference  is  shovm  in 
Fig.  (5-16) ,  while  Fig.  (5-17)  depicts  the  corresponding 
frequency 


Figure  5-16 


Figiure  5-17 


Figiire  5-18 


domain.  The  interesting  phenomena  in  the  corresponding  PDF  of 
Fig.  (5-18)  is  the  sharp  discontinuity  where  tne  two  intervals 
meet.  This  results  because  the  PDF  approximation  is  less 
accurate  at  the  edges  of  the  intervals,  so  the  two  least 
accurate  portions  of  the  approximation  are  set  directly  next 
to  each  other.  Since  the  MNT  is  computed  via  a  derivative,  a 
discontinuity  is  seen  in  the  resulting  MNT,  shown  in  Fig. 
(5-19). 
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Figure  5-19 


It  may  be  possible  to  remedy  this  problem  by  smooth  ig 
the  MIPA  to  the  PDF.  Examination  of  different  smoothing 
methods  is  an  objective  of  future  research. 

Even  with  the  marked  discontinuity  of  the  MNT,  the  final 
result  is  quite  close  to  the  original  message.  This  phenomena 
requires  further  examination.  In  Fig.  (5-20)  and  (5-21)  there 
are  only  three  errors.  Because  of  the  nature  of  the  MSK 
demodulation,  the  effects  of  these  errors  are  minimized.  The 
result  is  that  only  two  bits  out  of  128  are  in  error  at  the 
output  of  the  simulation. 

There  may  be  a  way  to  avoid  some  of  the  errors  that 
appear  in  Figs.  (5-20)  and  (5-21).  Notice  that  in  all  three 
error  bits  a  correct  decision  is  made  for  half  of  the  bit, 
while  an  incorrect  decision  is  made  for  the  remaining  half. 
This  arises  from  the  fact  that  the  quadrature  channel  lags  the 
in-phase  channel  by  90®.  Any  time  a  split  bit  occurs,  it  is 
certainly  an  error.  An  area  of  future  research  is  to  examine 
alternatives  for  deciding  which  value  to  choose  if  a  split  bit 
arises. 
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Figure  5~21 


Another  potential  method  for  avoiding  the  discontinuity 
problem  of  Fig.  (5-19)  is  to  use  just  one  interval  to 
approximate  the  entire  PDF.  The  problem  with  this  method  is 
it  lacks  the  flexible  response  of  the  multiple  interval 
approximations.  Added  flexibility  can  be  achieved  by 
increasing  the  polynomial  order,  but  this  results  in  more 
complex  calculations  and  the  need  for  more  precision. 
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Figiire  5-23 


An  example  of  a  fourth  order  MIPA  to  a  PDF  with  only  one 
interval  is  shown  in  Fig.  (5-22),  and  the  resulting  MNT  is 
shown  in  Fig.  (5-23) .  These  plots  are  in  response  to  15dB  of 
WB  interference.  There  were  no  errors  at  the  output  of  the 
simulator.  Un^il  an  alternate  algorithm  is  found  or  a  more 
powerful  simulator  is  used,  the  fourth  order  MIPA  simulation 
can  only  process  interference  levels  below  30  dB. 
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Both  the  MIPA  implementation  of  the  Locally  Optimal  (LO) 
demodulator  and  the  Higbie  approach  are  incorporated  into  the 
ANCP  simulation.  A  wide  range  of  CW  and  WB  interference 
levels  were  applied  to  various  orders  of  polynomial  and  number 
of  intervals.  The  interference  level  ranges  from  OdB  to  the 
limits  of  the  simulator.  At  high  interference  levels  (60dB 
for  order  0  and  2,  and  22dB  for  ord3r  4)  the  simulator  does 
not  have  enough  precision  to  compute  the  results  correctly. 
Near  and  below  zero  dB  the  large  J/S  assumption  of  the  LO 
demodulator  does  not  hold,  and  results  are  poor. 

Figure  (5-24)  plots  the  simulated  results  for  CW 
interference  ranging  from  0  to  60dB.  The  simulation  was 
performed  for  a  zero  order  MIPA  with  8  and  16  intervals,  a 
second  order  MIPA  with  4  and  8  bins,  and  the  Higbie  method 
with  8  bins. 


CW  Jammers 

Zero  and  Second  Order  MIPA 


Figure  5-24 
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It  appears  that  the  ANCP  simulation  performs  better  when 
the  number  of  intervals  is  small.  This  is  mainly  because 
there  is  a  limited  amount  of  samples.  When  the  PDF  is  divided 
into  many  intervals,  there  are  fewer  samples  in  each  interval. 
The  calculation  of  the  moments  is  probabalistic  in  nature  and 
thus  requires  a  large  number  of  samples  to  yield  adequate 
results.  Thus,  if  the  number  of  intervals  is  large,  the  MIPA 
will  be  based  on  inadequate  moments  and  will  produce 
unreliable  results.  This  problem  can  be  remedied  by 
generating  more  samples. 


Figure  (5-25)  shows  the  result  for  CW  interference 
combatted  by  a  fourth  order  MIPA,  both  with  two  intervals  and 
four  intervals.  This  results  of  this  configuration  of  the 
ANCP  simulator  are  not  as  good  as  described  above.  This  is  a 
result  of  several  factors,  including  limited  precision  of  the 
simulator,  limited  flexibility  of  the  MIPA  response,  and  lack 
of  sufficient  samples  to  achieve  convergence. 


CW  Jammers 

Fourth  Order  MIPA 


Toirfh  ordtr,  2  bin 
rnrth  cedtr;  4  bin 


Figure  5-25 
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Figures  (5-26)  and  (5-27)  report  the  results  of  WB 
interference  in  the  simulation.  Tiie  parameters  are  analogous 
to  the  CW  case  and  produce  similar  results. 


WB  Jammers 

Zero  ond  Second  Order  MIPA 


WB  Jamm«r  (d6) 


Figure  5-26 


WB  Jammers 

Fourth  Order  MIPA 


0  10  20  30 


WB  Jammer  LevW  (dB) 

Figure  5-27 
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6. 


SUMMARY  AMD  CONCLUSIONS 


The  major  focus  of  the  current  research  was  the 
implementation  of  the  MIPA  nonlinear  processor  in  the  current 
ANCP  simulation.  A  preliminary  examination  of  this 
implementation  was  performed  and  a  comparison  to  the  nonlinear 
processor  based  on  the  Higbie  approach  was  made.  Comparisons 
were  made  using  the  number  of  received  bits  in  error  as  a 
benchmark . 

Derivation  of  the  Locally  Optimal  ^LO)  maximum  likelihood 
function  was  required  because  of  an  error  in  the  Hazeltine 
report.  The  correct  LO  likelihood  function  is  given  by  Eq. 
(1-4)  . 


Solution  of  the  transition  matrix,  Q,  that  is  used  for 
transforming  normalized  MIPA  interval  coefficients  to  the  MIPA 
coefficients  for  the  conditional  PDF  approximation  was 
required.  A  discrepancy  in  the  Hazeltine  report  made  this 
development  necessary.  The  correct  transition  matrix  is  given 
by  Eq.  (3-28) . 

The  first  comparison  of  the  MIPA  implementation  of  the  LO 
maximum  likelihood  function  and  the  Higbie  approach  was 
theoretical.  It  is  shown  in  Appendix  B  that  the  optimum 
nonlinearities  corresponding  to  each  are  equivalent.  However, 
each  method  utilizes  this  nonlinearity  differently. 
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Once  the  theoretical  foundation  was  verified,  preliminary 
data  analysis  using  various  scenarios  was  performed  on  each 
simulator  implementation.  From  these  preliminary  results  it 
appears  that  the  performance  curves,  based  on  the  number  of 
received  bits  in  error,  for  most  tested  simulator 
configurations  have  similar  properties.  At  a  low  J/S  ratio 
errors  increase  since  the  large  interference  assumption  is  not 
valid.  As  the  J/S  ratio  increases,  the  performance  of  each 
method  is  better,  i.e.  the  number  of  bits  in  error  decreases. 
However,  errors  again  increase  as  the  J/S  ratio  increases 
beyond  the  range  of  the  simulator.  Loss  of  significant  bits, 
mathematical  roundoff,  and  arithmetic  overflow  are  some  of  the 
causes.  This  indicates  that  a  simulator  with  increased  power 
and  flexibility  will  be  necessary  in  the  future. 

Preliminary  data  for  the  addition  of  specific  types  of 
interference  was  gathered,  since  the  results  are  preliminary, 
global  conclusions  should  not  be  made  at  this  time.  More 
precise  data  must  be  gathered  before  stronger  claims  can  be 
presented.  For  example,  when  a  CW  inter ferer  is  added,  the 
preliminary  results  seem  to  indicate  that  a  histogram 
implementation  has  the  best  performance.  However,  as  the 
number  of  input  samples  increase  these  results  will  probably 
change.  Also,  if  a  time-varying  environment  is  approximated, 
this  would  affect  the  results  of  each  simulator  configuration. 
For  the  case  of  WB  noise,  it  is  not  possible  to  differentiate 
which  configuration  performs  best  from  the  preliminary  data. 
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It  will  be  necessary  to  examine  the  nature  and  relevance  of 
the  current  WB  noise  model.  Most  importantly,  the  preliminary 
results  illustrated  in  Fig.  (5-22)  through  Fig.  (5-25)  show 
that  each  nonlinear  processing  method  is  capable  of  countering 
the  effects  of  non-Gaussian  interference,  at  least  under 
specific  conditions. 

One  note  of  special  interest;  The  preliminary  results 
indicate  that  the  best  performance  for  each  simulator 
configuration  occurs  when  a  low  number  of  intervals,  or 
quantiles,  is  selected.  This  is  most  probably  a  result  of  the 
low  number  of  input  samples  used  by  the  simulation.  As  shown 
in  the  Hazeltine  report^^^,  with  input  samples  on  the  order  of 
10,000,  a  marked  improvement  can  expected  when  the  number  of 
intervals  is  increased.  Currently,  the  ANCP  simulation  uses 
1024  samples. 

The  present  research  and  the  summary  presented  here 
indicate  that  future  efforts  will  have  promising  results.  The 
future  objectives  of  this  research  are  presented  in  the  next 
section. 
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FUTURE  OBJECTIVES 


The  preliminary  work  on  this  project  suggests  that  the 
MIPA  and  Higbie  nonlinear  processors  have  the  potential  to 
effectively  mitigate  the  effects  of  non-Gaussian  jammers. 
However,  to  determine  which  processing  scheme  has  the  best 
overall  results  in  a  given  environment,  further  efforts  in 
this  area  are  necessary.  A  major  goal  of  the  future  research 
is  a  more  complete  study  of  various  nonlinear  processing 
techniques.  This  will  facilitate  the  determination  of  the 
optimal  processor.  Future  objectives  of  the  project  include: 

A.  Using  the  ANCP  simulation  to  extensively  verify  the 
results  of  the  Hazeltine  report.  Particular 
emphasis  will  be  placed  on  correcting  any 
discrepancies  found. 

B.  Determining  the  effect  of  varying  the  number  and 
size  of  the  intervals  in  the  MIPA  algorithm. 

C.  Determining  the  order  of  approximating  polynomial 
which  provides  optimal  performance  and  realistic 
complexity. 

D.  Determining  if  the  wide  band  noise  currently  being 
added  to  the  information  signal  is  Gaussian,  and  if 
not,  incorporating  a  Gaussian  wide  band  noise  model 

into  the  simulation. 
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E.  Determining  the  effects  of  adding  impulsive  noise 
and  impulse  excision. 

F.  Determining  the  effects  of  smoothing  on  the  MIPA 
algorithm. 

G.  Implementing  the  Hazeltine  memoryless  nonlinear 
transform  (MNT) ,  i.e.  the  optimum  nonlinear 
transform,  directly  using  the  MIPA  algorithm. 

H.  Examining  the  benefits  of  combining  the  MIPA 
algorithm  and  the  Higbie  approach  into  one  method. 

I.  Modifying  the  current  simulation  to  periodically 
update  the  probability  density  function  (PDF)  for 
the  MIPA  algorithm,  or  the  cumulative  distribution 
function  (CDF)  for  the  Higbie  approach.  This  will 
allow  investigation  into  the  area  of  training 
cycles,  that  is  time  necessary  for  a  processor  to 
learn  the  environment  before  actual  signal 
processing  begins.  Also,  examination  of  the  time 
varying  nature  of  the  signal  environment  will  be 
possible. 

J.  Examination  of  another  approach  developed  by  the 
Charles  Stark  Draper  Laboratory^®^ .  This  would 
include  verification  of  the  Draper  reports  and 
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implementation  of  the  Draper  Digital  Density 
Detector  in  the  ANCP  simulation.  In  addition,  an 
independent  simulation  would  be  developed  for 
examination  of  the  Draper  Robust  Digital  Adaptive 
Transceiver,  a  method  that  integrates  spatial, 
temporal,  and  amplitude  processing. 

The  anticipated  results  of  this  future  work  will 
illustrate  the  tradeoffs  of  each  nonlinear  processor  algorithm 
for  use  in  a  spread  spectrum  receiver.  With  this  knowledge, 
the  most  effective  processor  for  a  given  interference  scenario 
can  be  determined. 
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APPEMDIZ  A:  COMPARISON  BETWEEN  THE  LOCALLY  OPTIMAL  AMD 

HI6BIE  OPTIMUM  M0MLIMEARITIE8  FOR  TWO- 
DIMENSIONAL  SIGNALS 


The  Higbie  representation  of  a  two-dimensional  signal, 
after  seunpling,  is  given  by, 

(A-1) 

where:  is  the  in-phase  component, 

0^  is  the  quadrature  component. 

or  in  polar  form, 

(A-2) 

where:  A^=^iI+qI  is  the  magnitude, 

<t>;f=arctaji-^  is  the  phase. 

The  corresponding  Higbie  optimum  nonlinearity  can  be 
written  as, 

y={Ay+jA^e^**  (A-3) 
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with, 


a 

L>1 

K  \ 

i 

[  Pa^  ^^x'  ^  1 

[ 

l  j 

and, 


(A-4) 


(A-5) 


where :  (Aj^,4>^)  is  the  received  signal  probability 

density  function  (PDF)  in  polar  form, 
is  a  constant. 

To  show  the  equivalence  between  the  Higbie  and  the 
Locally  Optimal  nonlinearity,  it  is  necessary  to  derive  the 
Higbie  nonlinearity  for  the  received  signal  PDF  in  rectangular 
form.  This  PDF  can  be  derived  from  the  polar  form  PDF  using 
the  relation, 

Pa^  ^^x>  x)  ^^x^Pio  ^  > 


where:  i=i  (A^,  (t>^)  =Aj^xcos(J>^  and  g=g(Ajf,  (}> =A^xsin(l) . 
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Substituting  Eq.  (A-6)  into  Eq.  (A-4)  ,  the  result  is, 


2  Pxo(i»g) 


(A-7) 


Note  that, 


-^Pjo{i.g)--^Pjo(i,Q)  af 


(A-8) 


with  -^i-=cos4>„  and  ^2.  =  sin<(>  . 
dA^  ^ 


Therefore, 


As  ^  ^  Pro(^><3)  _  ^  PjgU.g) 

—  cos4)_x-ii - — r +Sin<j)xX 3 — — — — 

2  *  Pio{i,g)  PioU,g) 


(A-9) 


Similarly,  substituting  Eq.  (A-6)  into  Eq.  (A-5) ,  the  result 


2iA^j  A^xp^p(i,q) 


(A-10) 


Similar  to  the  develc  ment  of  Eq.  (A-9) ,  note  that. 


5;j7-Pie(-i/<3r)  =-^Pic>(-i»^)  3^'  *  (A-11) 


with  -^-  =  -A^sin<|)j^  and  -^^=A^co3^ 

o<p 


Therefore, 


-  (sin<j)jf)  X 


g) 

PTo(i><j) 


+COS(|>j^X 


Pro(i.q) 


(A-12) 


Now,  with  Ay  and  Ay  in  rectangular  coordinates  it  is  possible 
to  substitute  Eg.  (A-9)  and  Eg.  (A-12)  into  the  Eg.  (A-3)  . 
Noticing  that  e^^=cos4)+j  sin<j) ,  the  optimum  nonlinearity 
becomes , 


y=|A/+ jAjf}e-'**={Ay^+jA/}{cos(|>jf+j  sin4>j^}  (A-13) 


or, 

y={Ay^cos4>  ;^-Ay‘^s  in<|)  +  j  {Ay^sin<J>  ;f+Ay^cos4>  ( A-14 ) 


Now,  calculate  the  real  part  of  y; 


i?e{y}=Ay^cos<t> Ay  sin^ 


—  cos<t> 
2 


cos<J>  jjX 


■jrPjo^i.q) 

Pig(i.q) 


PioU.q) 


(A-15) 


{-sin<t)^) 


PigU.q) 


+  COS4>;jX 


■^PjgU.q) 

PioU,q) 
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(A-16) 


{cos^<t>_+sin^(J>  J  — --: — ^ — 

2  [  *  *  PioU,q) 


~  (cos<>jjSin4>jj-cos<j)j^in(|>j,) 


Pio^i.q) 


Similarly,  it  is  possible  to  calculate  the  imaginary  part  of 

y: 

Xm{y}=A/sin<|)  j^+a/cos<|) 
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-^PjO  (-2- / 

Iidy)=--:^  {cos<|)^sin<J>jf-cos<t>jjSin4>j^)  x-^= — — — ^ — 

2  Pjp  (2  ,  (J) 

(A-19) 


"aZ-Pxo  ( -i  /  <?) 
+  (cos2<|>--*-sin^4>  J  X — 2 — _ — j — 

Pjp(2,gr) 


Using  the  previous  trigonometric  identity: 


2 


(A-20) 


Therefore,  the  resulting  Higbie  optimum  nonlinearity  is: 


2  Pjg{i,q)  2  p^Q{i,q) 


(A-21) 


Compare  this  to  the  Locally  Optimal  maximum  likelihood 
function,  given  by: 

-^^IQi^kt^k)  .  (3-6) 
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It  can  be  seen  that  the  in-phase  and  quadrature  optimum 
nonlinearities  are  identical,  i.e.: 


Pigd.q)  figd.q) 


(A-22) 


-^PjQ{i,q)  ^  -^fjgh.q) 
Pig(i,q)  fjp{i,q) 


(A-23) 


However,  each  method  implements  these  optimum  nonlinearities 
differently. 

Next,  examine  the  case  when  the  received  signal  PDF  is 
assumed  to  have  bivariate  radial  symmetry.  The  received  PDF 
becomes : 

O<0<2n 

0  ,  elsewhere 


Then  Ay  becomes : 

Al\  A,  I  p,(A,) 

(A-25) 
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Multiplying  and  rearranging  terms: 


dA. 


■Pa  (AJ 


(A-26) 


Next ,  examine  Ay  : 


d  PaMx) 
aU  1  \  2:t 

2  I  ^xj  PaMx) 

2tc 


(A-27) 


Therefore,  the  Higbie  nonlinearity  of  Eq.  (A-3)  reduces  to; 


y~Ay  (cos4>x)  a/+j  (sin4>x)  a/ 


(A-28) 


y=. 


bos4)„x, 


dA 


■PaU^) 


Pa(^x) 


1 

A, 


■t-7sin4>  X 


dA/'^ 

Pa(AJ 


(A-29) 


Compare  this  to  the  Locally  Optimal  maximum  likelihood 
function,  given  by; 


K 

h-Y. 

k=l 


XCOS0X 


fe(r)  r 


+S;tjX  sin0x 


—  ^  (r) 
dr  ‘  ^  1 

fe(r)  r 


(1-5) 
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It  can  again  be  seen  that  the  in-phase  and  quadrature  optimum 
nonlinearities  are  identical,  i.e.: 


dA. 


di 

f^r)  r 


(A-30) 


As  stated  earlier,  each  method  implements  these  nonlinearities 
differently.  The  result  is  two  different  nonlinear  processors 
based  on  the  same  optimum  nonlinearity. 
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» 


APPENDIX  D:  DERIVATION  OF  THE  LOCALLY  OPTIMAL  MAXIMUM 

LIKELIHOOD  FUNCTION  FOR  COMPLEX  SIGNALS 


,  The  transmitted,  or  source,  signal  is  represented  by  a 

random  process  with  an  in-phase  component  R(t)  and  a 
quadrature  component  S(t) .  For  tha  received  signal,  the  in- 
phase  component  is  I(t)  and  the  quadrature  component  is  Q(t)  . 
Finally,  the  interference  is  characterized  by  an  in-phase 

» 

signal  X(t) ,  and  a  quadrature  signal  Y(t).  After  sampling  at 
the  receiver  input,  the  following  relation  results: 

,  I=R+X  0=s+y  (B-1) 


The  cumulative  distribution  function,  ,q)  /  is 

calculated  using  the  above  relations,  i.e.: 

FjQ(i,q) =P(I<i,Q<q) =P(R+X<1 ,S+Y<q)  (B-2) 


Therefore; 


•♦-00  +00  g-s  i-JT 


=  J  J  J  j  fjtsxr(^f^rX,y)  dxdydrds  (B-3) 


-00  >00  >00  >00 


where:  is  the  joint  PDF  of  the  random 

variables  r,s,x,  and  y. 
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If  the  transmitted  signal  is  independent  of  che  noise,  then: 


+0D  4-CD  i-iT 


=  |  J  j  j  fRs(r,s)  fxY{x,y)  dxdydrds  (B-4) 


-O  -CD  -CD  -CD 


where:  fjig(r,s)  is  the  joint  PDF  of  the  transmitted 

signal 

■fxr(^'y)  is  joint  PDF  of  the  interference. 

Let  HxY(x,y)  =jjfxy(x,y)  dxdy .  Substituting: 

FjqU.q)  =J s){[Hj„(i-r,  q-s)  -»)  ] 


(B-5) 


q-s)  -«*)  j)drds 


The  PDF  of  the  received  signal,  fjQ(i,q)  ,  is  obtained  by 
differentiation: 


32 


(B-6) 


fio(i,q)  =1  f  fns(T,s)  -^-{[H„{i-r,q-s)  -«>)  ] 

(B-7) 

-  q-s)  -»)  ]}drds 


8-64 


■fio(i'9)  =  j  Jfi,s(r,s){fxr(i-i‘/g-s) -0-0+0} dr ds  (B-S) 

••00—00 


Therefore; 


■fiQ(i»g)  =  j  (B-9) 


Next,  if  the  transmitted  signal  is  one  of  N  equiprobable 
transmitted  signal  sets,  then: 


1  w 

/^5(r,s)=_J]  Sir-rj)S{s-Sj) 


(B-IO) 


'J=l 


where:  and  Sj  are  one  of  N  possible  signal  sets, 

S(')  is  the  discrete  delta  function. 


The  resulting  received  signal  PDF  is: 


jf  >004-00 

fxo(i/g)=-^X  f  \  fxY(^-rj,q-Sj)  S  {r-rj)  S  (s-Sj)  drds 

J  =  1  ^00-09 


(B-11) 


or: 


^io(i»g)=4E  (B-12) 

"j=i 
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For  maximum  likelihood  detection,  it  is  desired  to 
maocimize  P(R=rj,S=Sj\I=i,Q=q)  .  This  is  equivalent  to  choosingr^ 
and  Sj ,  j=l->N,  to  maximize  f^Y{i-rj,q-Sj)  .  However,  if  we 
assume  that  K  samples  of  each  received  signal  are  used  to 
determine  the  maximum  probability,  then  the  transmitted  signal 
set  is  increased  to  an  N-ary  signal  set  in  which  each  signal 
is  a  vector  of  length  K.  This  can  be  written  as  I=R+x  and 
^R+Y.  Assuming  independent  and  identically  distributed 
samples,  then  the  received  signal  PDF  is  given  by: 


k~^jk>  ^k~^jk^  ( 


(B-13) 


Therefore,  for  maximum  likelihood  detection,  choose  fj  and 
j=l-*N,  to  maximize: 

K 

~W  ^XY^^k~^jki^kr^jk)  (B-14) 


Where:  K  is  the  length  of  a  signal  vector. 

To  simplify  calculations,  the  natural  logarithm  of  this 
function  is  used.  This  results  in  the  Globally  Optimal  (GO) 
maximum  likelihood  function: 


« 


< 


i 


i 
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Choose  fj  and  Sj,  to  maximize 


A»1 


(B-15) 


where:  qj^  is  the  sample  of  the  received  signal 

for  the  I  and  Q  channels  respectively, 

^kji  is  the  of  K  samples  used  to 

represent  the  transmitted  signal  pair. 

To  further  simplify  calculations,  a  Taylor  Series 
approximation  of  the  interference  PDF  is  used.  The  expansion 
is  around  the  point  since  this  value  is  known.  This 

approximation  is  useful  if  a  large  interference  to  signal 
ratio  is  assumed.  Therefore: 

^XY^^k~^jki^k~^jk)  ”^XY(^kf^k)  ^ -^^XY^^k'^k)  ^  (^k~^k) 

(B-16) 

*-^^XYi^kf^k)'^iyk~^k) 

Recalling  that 

^XY(^k~^jk>^k~^jk)  ^^XY^^k'^k)  ~^jk^-^^XY(^kf^k)  ~^jk^-^^XY^^kf^k) 

(B-17) 
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Substituting  this  result  into  the  GO  maximum  likelihood 
function; 


^  r  .  a  .  8  . 


(B-18) 


K 

k=l 


-g— fxr  -^^XY^^kt^k) 


(B-19) 


Since  we  are  comparing  fj,  Sj  to  maximize  Lj,  an  equivalent 
likelihood  function  is: 


L,-!  in^ 

X«1 


-^^XY^^kt^k)  -^^XY^^kf^k) 

^XY(^k>^k)  ^XY^^kt^k) 


(B-20) 


Using  the  approximation  ln(l-x)»-x; 


V-I 

x-i 


-^^XY^^k'^k)  -gfy^XYi^kf^k) 


^XY(^k*^k)  ^XYi^kf^k) 


(B-21) 


Finally,  since  a  high  interference  to  signal  ratio  was 
assumed,  /;fy(i,g)a/jQ(i,g)  and  R<' <X,  S<<Y,  This  results  in 
the  Locally  Optimal  (LO)  maximum  likelihood  function: 
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Choose  fj  and  Sj,  j=l-*N,  to  maximize 


X 

h-I. 


;c-i 


k'^k)  r^^IQ^^k'^k) 

“  f.^'rr.  rr."\  *^kj^ 


^io(^kf^k) 


^IQ(^ki^k) 


(B-22) 


Further  simplification  of  the  LO  maximum  likelihood  function 
results  when  complex  radial  symmetry  of  the  received  signal 
PDF  is  assumed.  Under  this  assumption: 


fiQU,q)  = 


fair) 
2nr  ' 


0<6<2Tr 

elsewhere 


(B-23) 


where:  fiQ(‘  )  is  the  received  signal  PDF, 

f^(r)  is  the  received  envelope  PDF, 
r=\Ji^^q^  is  the  received  magnitude, 

0=arctan5  is  the  received  phase  angle. 

To  derive  the  expression  for  the  LO  likelihood  function,  first 
the  expressions  for  the  partial  derivatives  must  be  shown: 


since 


dr 

■31 


=cos6 : 


-^fiQ{i,q)  =cos0x_^.^fii2. 
•'ll  dr  2rrr 


(B-25) 


Similarly: 


d 

-3q 


£iQU,q) 


a  _  d  ^ar 

"Sq  2irr  ~3r  2irr  '3q 


(B-26) 


Since 


4^=sin0 : 


d 


,g)  =sin0x_z — z — 
■^g  xo'  2nr 


(B-27) 


Next: 


d  -%(r)  ^  _  re(^)  (B-28) 

dr  2irr  27rr  2nr^ 


Substituting  these  results  into  the  expression  for  the  LO 
maximum  likelihood  function  of  Eq.  (B-22),  the  result  is: 


Choose  fj  and  S^,  j=l-*N,  to  maximize 


K 


|r;tjXCOS0X 


1 

fe(^)  ^ 


+S;jjXsin0x 


dr  ® '  '  ^  1 
fe(^)  ^ 


(B-29) 
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ABSTRACT 

Atomic  layer  epitaxy  is  a  technique  applicable  in  certain 
chemical  vapor  depositions  which  deposits  one  atonic  atom  layer 
at  a  time  upon  the  substrate.  This  technique  has  been  used  to 
deposit  thin  films  of  gallium  arsenide,  but  has  not  been  reported 
in  the  literature  to  our  knowledge  to  have  been  used  to  deposit 
any  titanium  compounds.  Because  of  the  crystal  structure  of 
titanium  carbide,  it  may  be  possible  that  atomic  layer  epitaxy 
might  be  achieved  under  favorable  conditions.  If  successful, 
precisely  controlled  film  thicknesses  could  be  produced  with 
electronic,  mechanical,  and  corrosion  resistant  applications. 

A  system  has  been  designed  and  equipment  ordered  to  con¬ 
struct  a  thin  film  deposition  system  to  test  this  hypothesis. 

Funds  from  a  current  NASA  grant  have  also  been  used  in  addition 
to  the  funds  for  this  project.  Two  undergraduate  senior  project 
papers  resulted  from  this  effort.  Work  to  assemble  and  test  the 
system  is  still  ongoing  at  the  present,  and  so  no  concrete  results 
are  available  now.  Whatever  results  are  obtained  will  be  reported 
to  the  Air  Force  as  soon  as  they  are  analyzed. 
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I.  INTRODUCTION: 


During  a  ten-week  period  in  the  summer  of  1988,  I  participated 
in  the  1988  USAF-UES  SUMMER  FACULTY  RESEARCH  PROGRAM /GRADUATE  STUDENT  • 
RESEARCH  PROGRAM.  I  was  selected  to  work  at  the  Rome  Air  Development 
Center  Solid  State  Sciences  Directorate  of  the  Electromagnetic  Materi¬ 
als  Technology  Division  at  Hanscom  AFB,  Bedford,  MA,  near  Boston.  My  « 
assignment,  as  jointly  agreed  upon  with  my  technical  focal  point  there, 
was  to  assist  in  the  design,  fabrication,  construction,  and  assembly 
of  a  chemical  vapor  deposition  system  to  deposit  titanium  nitride  thin  < 
films.  The  particular  .nphasis  was  to  attempt  to  achieve  films  deposi¬ 
ted  one  atom  layer  at  a  time.  This  technique  is  known  as  atomic  layer 
epitaxy.  If  successful  atomic  layer  epitaxy  can  be  used  to  deposit  ' 

films  of  exactly  controlled  thickness  having  applications  in  micro¬ 
electronics,  electrooptics,  high  strength  materials,  and  corrosion 
resistance.  The  construction  of  this  system  was  not  complete  when  my  ’ 
research  period  was  over,  and  was  left  to  base  personnel  to  complete 
and  operate. 

Later,  I  was  successful  in  my  application  for  a  mini-grant  fol- 
low-on  project  to  develop  a  similar  atomic  layer  epitaxy  system  on  the 
campus  of  Prairie  View  A&M  University.  The  focus  of  this  application 
was  titanium  carbide,  but  other  compounds  of  titanium  and  other  ele¬ 
ments  were  also  possible  candidates. 

This  report  focuses  on  the  efforts  to  design  and  construct  such 
a  system.  Funds  from  NASA,  obtained  in  a  separate  proposal,  were  also 
used  in  developing  the  system.  That  NASA  grant  still  continues. 
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» 

» 

II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT: 

» 

Gallium  arsenide  (GaAs)  has  frequently  been  deposited  using 
»  atomic  layer  epitaxy,  as  have  other  II-VI  and  III-V  semiconductor 

compounds.  GaAs  has  the  zincblende  crystal  structure,  as  shown 
in  Figure  1,  and  it  is  easy  to  visualize  how  a  layer  of  Ga  could 
•  be  followed  by  a  layer  of  As,  and  so  forth,  until  a  thin  film 

of  several  atom  layers  of  each  element  was  deposited  to  make  up 
the  compound.  The  structure  of  titanium  carbide,  where  the  small 
►  carbon  atom  is  at  the  center  of  an  octahedral  structure,  is  shown 

in  Figure  2.  Although  it  is  difficult  to  imagine  how  a  single 
layer  of  titanium  atoms  could  be  followed  by  a  single  layer  of 
»  carbon  atoms  to  make  up  the  structure,  it  is  precisely  this  that 

we  are  attempting  to  accomplish. 

^  In  short,  it  is  our  objective  to  design,  construct,  and 

assemble  the  components  necessary  to  attempt  the  atomic  layer 
epitaxial  deposition  of  titanium  carbide  (TiC) .  We  anticipate 
that  this  can  best  be  done  at  a  substrate  temperature  in  the 
I  vicinity  of  750°C  and  at  a  pressure  of  approximately  40  torr. 

The  source  of  the  titanium  is  to  be  titanium  tetrachloride 
(TiCl^),  which  is  a  liquid  at  room  temperature.  The  titanium 
chloride  will  be  vaporized  by  and  carried  to  the  reactor  by  a 
stream  of  hydrogen.  Vaporization  will  occur  in  a  glass  bubbler. 
The  carbon  source  is  to  be  carbon  tetrachloride  (CCl^)  liquid, 
also  vaporized  by  hydrogen  in  a  bubbler.  Hydrogen  serves  as  the 
carrier  gas,  and  also  serves  as  a  reactant  in  the  reactions 
TiCl^  +  2  H2 - >  Ti  +  4  HCl 
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< 


CCI4  +  2  - >  C  +  4  HCl 

Then  the  titanium  carbide  is  formed  on  the  substrate  surface  by 

Ti  +  c  - >  Tic 

The  key  to  having  these  reactions  proceed  one  atom  layer  at 
a  time  rests  with  Lhe  reaction  mechanism,  ^f  the  ease  of  vapori¬ 
zation  of  a  complex  carrying  a  titanium  atom  is  greater  than  the 
ease  of  forming  a  titanium  metal  layer  on  top  of  Tic,  and  the  same 
is  similarly  true  for  a  carbon-containing  complex,  then  there  is  a 
good  chance  that  atomic  layer  epitaxy  (ALE)  will  occur.  If  ALE 
does  occur,  then  precise  control  of  layer  thickness  can  be  obtained, 
in  fact  precise  control  to  the  nearest  single  atom  layer. 

The  procedure  will  be  as  follows: 

1)  Evacuate  the  chamber  containing  the  surface-cleaned  substrate, 
and  heat  the  substrate  to  the  desired  deposition  temperature. 

2)  Flush  the  system  with  pure  hydrogen,  and  adjust  the  system 
pressure  to  the  desired  value. 

3)  Allow  the  gas  flow  containing  Ti  to  enter  the  chamber,  and 
contact  with  the  substrate  occurs  for  a  few  measured  seconds. 

4)  Flush  the  chamber  with  hydrogen  to  remove  nearly  all  of  the  Ti- 
containing  gas,  and  desorb  any  unreacted  Ti-complex  from  the 
substrate.  This  may  take  many  multiples  of  the  time  period  of 
step  3 . 

5)  Admit  the  c-containing  gas  for  a  measured  time  interval. 

6)  Repeat  step  4  for  the  C-containing  gas. 

7)  Return  to  step  l  until  the  desired  number  of  monolayers  are 
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deposited. 


III.  LITERATURE  REVIEW: 

The  literatxire  review  of  the  proposal  is  included  as  an 
appendix,  since  it  would  serve  no  purpose  to  repeat  it  here. 

Most  of  it  serves  to  chronicle  the  chemical  vapor  deposition 
of  electrically  conductive  titanium  compounds  —  titaniiun  boride, 
titanium  carbide,  and  titanium  nitride.  None  of  these  compounds 
had  been  reported  in  the  literature  as  deposited  by  atomic  layer 
epitaxy  as  of  January,  1989,  and  the  same  still  holds  true  today. 

What  will  be  discussed  here  is  the  additional  literature 
since  the  original  proposal.  All  of  these  relate  to  titanium  nitride, 
except  for  the  senior  papers  of  the  two  undergraduate  students  who 
have  worked  on  the  project  reported  upon  here. 

lanno,  et  al.  (1)  used  plasma  enhancement  to  deposit  titanium 
nitride  using  nitrogen  as  the  nitrogen  source.  They  used  a  rather 
high  residual  pressure  of  1.1  torr.  They  were  able  to  achieve 
reasonably  low  resistivity  thin  films  at  substrate  temperatures 
as  low  as  500°C.  They  also  investigated  changing  flow  rates 
of  the  reactants,  and  found  that  nitrogen  had  to  be  supplied  in 
excess  of  stoichiometric  amounts  to  reduce  the  chlorine  content 
of  the  films.  The  chlorine  content  is  mainly  responsible  for  any 
increase  in  resistivity.  They  proposed  a  mechanism  where  TiCl3 
was  the  reactive  species.  Several  uses  of  these  films  as  metal¬ 
lization  barriers  in  microelectronic  applications  were  given. 


9-7 


Ishihara  et  al.  (2)  produce  TiN  films  using  a  metalorganic 
source  for  the  titanium  —  TiCNCCH^) 2) 4 •  In  thij 

case,  ammonia  was  used  as  the  nitrogen  source,  and  the  pressure 
was  0.3  torr.  The  temperature  range  investigated  was  300-580°C, 
but  the  golden  TiN  color  did  not  develop  until  580°C,  where  the 
lowest  resistivity  films  were  obtained.  Step  coverage  was  good, 
approximately  50%. 

Yokoyama,  et  al.  (3)  use  the  same  chemical  system  as  Ishihara, 
et  al.,  but  at  a  pressure  of  only  0.188  torr.  They  developed  a  new 
heater  system  capable  of  800°C,  but  carried  out  depositions  in 
the  range  500-700°C.  In  this  case,  step  coverage  was  almost 
perfect,  even  filling  completely  some  large  aspect  ratio,  small 
diameter  holes.  They  also  carried  out  hydrogen  annealing  of  the 
deposited  films.  This  technique  dropped  chlorine  content  from  the 
small  deposited  level  of  5.8%  to  below  1%,  and  resistivities 
also  dropped.  The  authors  suspect  that  the  chlorine  is  removed 
as  ammonium  chloride,  which  explains  why  excess  nitrogen  is 
beneficial.  Their  500°C  deposit  is  dark  red-brown,  but  changes 
to  the  typical  golden  color  upon  annealing.  The  deposit  is  highly 
columnar. 

Michelle  Boyard  (4)  and  Caryn  Davis  (5)  each  wrote  Senior 
Design  Project  Reports  based  on  the  work  they  accomplished  in 
designing  portions  of  the  hardware  for  our  deposition  system. 

Miss  Boyard 's  report  concentrates  on  the  temperature  control  and 
the  plasma  control  systems,  including  the  heaters  and  the  plasma 
system's  matching  network.  Plasma  may  be  used  in  this  work,  and 


will  definitely  be  used  for  NASA's  amorphous  silicon  deposition 
work.  Miss  Davis'  report  discusses  the  design  of  the  gas  supply 
system  and  the  vacuum  system. 


IV.  DESIGN  OF  THE  HARDWARE: 

The  main  equipment  item  is  the  deposition  chamber,  depicted 
schematically  in  Figure  3.  It  consists  of  a  vycor  glass  cylinder 
13  inches  in  diameter  and  about  16  inches  high.  The  chamber  fits 
over  a  stainless  steel  support  ring,  through  which  electrical 
and  gas  feedthroughs  pass.  Vacuum  is  applied  from  below.  The 
substrate  is  mounted  upon  a  stainless  steel  plate  insulated  from 
below  by  a  ceramic,  and  heated  by  a  resistance  plate  heater. 

The  stainless  steel  can  serve  as  an  electrode  should  plasma 
depositions  be  required,  and  a  heated  top  electrode  is  also 
present. 

Figure  4  shows  the  schematic  diagram  of  the  gas  supply 
system.  Both  feed  gases  are  supplied  in  a  hydrogen  mixture 
which  passes  through  bubblers.  The  reactive  gases  are  set  up 
in  a  "vent-and-run"  fashion,  which  means  that  the  flow  rate 
through  the  bubblers  is  maintained  constant.  Each  needle  valve 
is  adjusted  so  that  the  path  of  the  gas  straight  to  the  vacuum 
system  has  exactly  the  same  flow  resistance  as  the  path  of  the 
gas  through  the  ALE  chamber.  Nitrogen  is  available  as  a  purge 
gas,  since  it  is  anticipated  that  ammonia  would  be  used  as  a 
nitrogen  source  should  TiN  ever  be  deposited. 

Figure  5  shows  the  overall  arrangement  of  equipment.  Mass 
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flow  controllers  maintain  critical  gas  flow  rates  constant.  A 
Tylan  flow  sequencer  can  be  programmed  to  maintain  the  complicated 
gas  flow  sec[uence  schedule  required  by  the  atomic  layer  epitaxy 
procedure . 

Table  1  shows  most  of  the  equipment  specifications.  A  silane 
gas  supply  system  will  also  be  added  to  the  system  shown,  since 
NASA  is  sponsoring  a  concurrent  project  to  deposit  amorphous  silicon 
thin  films.  Table  2  shows  costs  of  some  of  the  Table  1  equipment, 
as  well  as  model  numbers  and  suppliers.  No  particular  equipment 
recommendations  can  be  made,  since  the  system  has  not  yet  been 
completely  assembled  or  operated. 


V.  RESULTS: 

As  mentioned  before,  the  system  is  still  under  construction 
and  assembly,  so  no  results  or  recommendations  can  be  made  at 
this  time. 
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llolleck  (5,6)  offers  review  papers  on  selecting 
materials  useful  for  hard,  wear  resistant  coatings.  he 
einpliasizes  that  substrate  and  coating  must  have  similar 
tliermal  expansion  coefficients,  and  lists  titanium 
nitrides,  borides  and  carbides  as  candidate  materials. 
Knotek  (10)  et  al.  investigate  sputtered  films  of 
titanium  nitrides,  carbides  and  mixtures  for  hardness. 
Tsakalos  (24)  shows  that  supe r lattices  with  wave¬ 
lengths  of  the  order  of  about  one  nanometer  sliow 
improved  elastic  moduli,  wliicli  show  maxima  at  certain 
wavelengths.  Takaliashi  and  Kniuiya  (21)  used  CVl) 
to  deposit  Ti-V-li  films. 

Ilelmerscon  et  al.  (4)  show  a  hardness  maximum  in 
the  TiN/VN  s t r a i ned- 1 aye r  superiattice  system  at  a 
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wavelength  of  about  6  nm.  They  also 
hardness  when  the  ratio  of  TiN  layer 
layer  thickness  is  about  0.35.  They 
technique  to  grow  the  films  at  about 


show  a  maximum  in 
thickness  to  total 
used  a  sputtering 
750°C. 


Kim  and  Chun  (9)  used  CVD  to  deposit  TiN  onto  TiC 
at  about  900-1150°C  and  Itoh  (7)  deposited  alternating 
TiN  and  TiB2  layers  at  the  same  temperatures.  High¬ 
hardness,  adherent  coatings  were  achieved.  Moore  et 
al.  (13)  produced  four  sequential  layers  on  a  WC 
cutting  tool  using  CVD,  including  TiN  and  TiC.  Kim 
et  al .  (8)  discuss  CVD  of  TiC  alone. 


Kurtz  and  Gordon  (11)  discuss  the  use 
electronic  and  several  other  applications, 
films  from  400-700°C,  with  600®C  being  the 
for  best  results.  They  also  test  metal-or 
sources . 


of  TiN  in 
and  deposit 
lower  limit 
ganic  gas 


Titanium  boride  films  are  complicated  by  the  fact 
that  the  boron  component  can  be  lost  by  reaction  with 
residual  water  vapor  and  subsequent  volatilization,  as 
shown  by  Feldman  et  al.  (3).  The  result  is  often 
uncertain  stoichiometry.  Lynch  et  al.  (12)  showed  this 
was  a  problem  even  in  bulk  single  crystals. 

Work  on  the  TiB2  system  dates  back  to  least  as 
far  as  1949,  when  Ehrlich  (2)  prepared  several  titanium 
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compounds  by  sintering.  Shappiro  et  al.  discuss  some 
of  the  electronic  applications  for  TiB2.  Caputo  et  al. 
(1)  and  Motojima  et  al.  (14)  discuss  erosion  resistance 
and  hardness  of  CVD  coatings  of  TiB^.  Pierson  et  al. 
(17,10)  used  CVD  to  produce  TiB2  coatings  on  graphite 
from  600-925°C.  Takahashi  and  Itoh  (20)  used  an 
ultrasonic  field  during  CVD  to  decrease  the  grain  size 
of  the  deposit.  Williams  (27)  uses  a  plasma  to  lower 
the  CVD  deposition  temperature  of  TiB2  films. 

Ozeki  et  al.  (16)  and  Tischler  and  Bedair  (22,23) 
give  explanations  of  why  monolayers  can  be  deposited 
by  ALE  in  the  GaAs  system  and  the  InAs  and  In-Ga-As 
systems.  Nelson  (15)  describes  ALE  applied  tp  ZnS  and 
ZnSe . 

Wang  et  al.  (25)  show  how  Ti02  smoke  particles 
allow  visualization  of  flow  patterns,  and  assist  in 
the  spatial  design  of  CVD  reactors. 
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TABLE  1  .  continued 


Vacuum  valve 

D 

-  1  in 

• 

body-SS 
copper  0-ring 

Solenoid  valves 

D 

“  .25 

in . 

normally 

closed 

body-SS 

Needle  valves 

D 

-  .25 

in . 

0  -  .055  in., 

F  -  2510  cm-’/s 

body-SS 

seals-SS 

Plug  valves 

U 

-  .25 

in . 

»:.r  -  * 

body-SS 

seals-SS 

VCR  connccLions 

U 

-  .25 

In . 

body-SS 

seals-SS 

Elbows 

D 

=  .25 

i  n . 

all  SS 

Unions 

D 

=•  .25 

i  n . 

all  SS 

Tees 

D 

>  .25 

i  n . 

-6 

all  SS 

In-line  filter 

D  -,5x10  m 

Body-SS 

P 

filter 
medium  - 
mesh 
element 

Cliock  valves 

Pq  -  20  psig 

Body-SS 
0-ring- 
vi  ton 

Vacuum  pump  oil 

V  -  2 . 1  L 

P 

Fombl  in 

Pressure  gauge 

=  .25  in . 

P  •>  0-760  Torr 
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Table  2.  Price  List 


Model,  type  or  Component  Quantity  Manufacturer  Total  Price 

part  no. _ _ _ 


SS-404-1 

Ferrules  (back) 

50 

Nupro 

204.00 

SS-403-1 

Ferrules  ( front ) 

50 

Nupro 

30.50 

Tymgard 

Flow  seqtiencer 
and  timer 

1 

Tylaa 

3430.00 

lA 

II2  gas  cylinder 

1 

Matheson 

$100.00 

3104-350 

fl2  regulator 

1 

Matheson 

$206.00 

SS-4F- . 5 

In-Line  Filter 

2 

Nupro 

$72.00 

FC-260 

Mass  Flow 
controller 

4 

Tylan 

$3580.00 

7532-10 

Midget  bubbler 

2 

Ace  Glass 

$70.00 

SS-4DMW 

Needle  Valve 

2 

Nupro 

$1738.00 

lA 

n2  gas  cylinder 

1 

Matheson 

$35.00 

0-500 

N2  regulator 

1 

Matheson 

$196.00 

S-4H 

Plug  Valve 

20 

Nupro 

$1738.00 

25-1009SW-02L 

Pressure  Gauge 

1 

Mye  r 

$48.51 

7x 

Sill4  gas  cylinder 

1 

Matheson. 

$147.00 

3601-350 

SiH4  regulator 

1 

Matheson 

$517.00 

SS-400-6 

Union 

20 

Swagelok 

$143.00 

SS-400-9 

Union  Elbow 

20 

Swagelok 

$204.00 

SS-400-3 

Union  '''ee 

40 

Swagelok 

$602.00 

Fombl in 

Vacuum  Pump  Oil 

1 

Lesker 

$075.00 

304-24VFD(; 

Vacuum  Valvc 

1 

Nupro 

$525.30 

Note:  Swagelok  and  Niipro  parts  were  purchased 

Nortli  Houston  Valve  and  Fitting  Company. 

through  local 

supplier 
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I.  Introduction 


Pattern  Recognition  is  an  important  problem  for  both  military  and  civil  appli¬ 
cations.  The  difficult  part  of  the  problem  is  that  reaJ-world  patterns  axe  generadly 
transformed  in  many  ways  before  being  presented  for  identification.  The  ability  to 
recognize  transformed  patterns  is  called  Invarimt  Pattern  Recognition. 

Neural  Network  architecture  is  a  relatively  new  technique.  (Actually,  the  tech¬ 
nique  is  fairly  old,  but  it  was  only  recently  that  its  value  has  become  widely  rec¬ 
ognized.)  Neural  Networks  exhibit  good  behavior  with  respect  to  incomplete  data, 
they  offer  the  speed  advantages  of  parallel  implementation,  and  they  have  “graceful 
degradation”;  this  means  that  a  Neured  system  can  continue  to  perform  well  in  the 
event  of  component  failures,  rather  than  just  stopping.  These  traits  suggest  that 
Neural  Networks  may  be  important  parts  of  various  pattern  recognition  systems. 
This  was  the  general  focus  of  this  research  effort. 

The  particular  area  of  application  was  image  recognition.  The  images  used  were 
gray-sczde  images  of  aircraft  planforms,  simulating  overhead  imagery.  The  system 
was  ‘trained’  on  these  images,  and  tested  on  images  which  had  been  transformed 
by  translation,  scaling,  occlusion,  small  in-plane  rotations,  and  additive  Gaussian 
white  noise  of  fairly  ;gh  power.  The  objective  was  to  design  a  Neural  Network 
beised  system  which  could  succesfully  recognize  the  tremsformed  images. 

II.  The  Problem 

An  image  is  a  finite  energy  (ie,  L^)  function  /:  R^  — ♦  R  .  The  value  of  /(x,y) 
is  the  intensity  of  the  image  at  the  point  (x,  y).  It  can  be  assumed  that  the  value 
of  /  is  0  outside  of  a  bounded  set,  ie,  there  are  no  images  of  infinite  extent. 

Reseeu-ch  into  the  Human  Vision  System  (HVS)  (see  [KFAW]  for  recent  ideas) 
indicates  that  it  uses  several  scales  in  the  identification  process  (the  system  is, 
of  course,  much  more  complicated  than  this,  although  multiscale  is  an  important 
strategy).  At  the  largest  scale  it  determines  gross  attributes  (house,  airplane,  bird, 
Chinese  character)  of  the  objects  which  it  senses.  For  example,  the  large  scale  view 
of  an  object  in  a  overhead  image  might  determine  that  it  is  a  wide-bodied  airplane; 
a  slightly  smaJler  scale  view  might  determine  that  it  has  two  wing-mounted  engines, 
thus  reducing  the  possible  solutions  to  Airbus  A320  or  Boeing  767;  and  a  smaller 
scale  view  might  detect  the  presence  of  flap  track  fmrings,  which  are  not  present  on 
the  767.  (Of  course,  at  very  small  scale  one  might  be  able  to  read  the  manufacturer’s 
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data  plate.) 

Kosslyn  et.  al.  call  some  of  these  small-scale  features  “trigger  features'’.  The 
presence  of  a  “trigger  feature”  suffices  to  identify  an  object  despite  gross  treinsfor- 
mations.  A  musicail  analogy  might  be  the  ability  to  name  certain  popular  songs 
after  hearing  just  a  few  notes. 

It  seems  reasonable  to  design  an  automatic  identification  system  which  mimics 
this  approach  (although,  as  will  be  discussed  later,  this  is  not  the  only  good  ap¬ 
proach).  The  problem  is  to  use  the  various  scales  in  a  systematic  and  manageable 
way,  and  to  determine  the  appropriate  “trigger  features” . 

III.  Design  Considerations 

The  original  plan  for  this  effort  was  based  on  a  Neural  Network  architecture 
called  the  Neocognitron.  The  neocognitron  is  a  multiple-layer  feed  forward  network. 
Each  layer  consists  of  two  types  of  units.  One  type  of  unit  responds  to  a  low-level 
feature  in  a  certain  region  of  the  previous  layer,  while  the  second  type  ‘integrates’ 
responses  from  the  first  type  to  form  a  part  of  a  feature  at  this  level.  This  design 
accomplishes  the  multiple  sc2de  analysis  described  above,  although  there  is  no  strict 
analog  of  the  “trigger  feature”.  Fukushima  [F]  developed  this  architecture  and  was 
successful  using  it  to  recognize  hzmdwritten  numerals. 

The  proposal  was  to  enhance  this  architecture  through  preprocessing  of  the 
images  and  other  means  to  enable  it  to  handle  larger,  gray-scale  images,  rather 
than  outlines.  The  enhancements  developed,  which  are  described  below,  were  such 
that  the  only  vestige  of  the  neocognitron  in  the  system  is  the  presence  of  a  feed¬ 
forward  network.  The  response  to  low  level  features  has  been  replaced  by  the  Gabor 
transform,  which  filters  for  spatiail  frequency  and  orientation;  the  “integration”  of 
low  level  features  is  bypassed  using  entropy  (low  entropy  indicates  high  information 
content),  which  selects  the  characteristic  features  in  the  image;  and  the  slowly- 
trained  neocognitron  has  been  replawred  by  the  Probability  NeuraJ  Network,  which 
features  instantaneous  training.  The  resulting  system  is  able  to  work  easily  with 
gray  scade  images  but  is  more  flexible  and  easier  to  train. 

IV.  Implementation  Considerations 

All  of  the  algorithms  discussed  in  this  report  were  implemented  in  C  programs 
which  simulate  parallelism.  Prograuns  were  primarily  developed  on  the  Macintosh 
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Ilci  computer  which  was  purchased  with  the  grant  funding.  The  programs  have 
been  successfully  ported  to  a  Sun  workstation  and  IBM-PC  compatibles. 

Several  relatively  inexpensive  parallel  machines  have  been  announced  since 
this  effort  began.  These  include  the  MasPar  and  DAP  machines.  It  would  be  very 
interesting  to  test  these  algorithms  on  such  a  machine,  especiedly  in  view  of  the 
parallel  nature  of  the  edgorithms. 

It  had  been  proposed  to  test  these  algorithms  with  “real”  images,  but  security 
and  funding  considerations  prevented  this. 

V.  The  Design 

The  first  step  in  the  processing  of  the  images  is  the  “Gabor  transform”.  The 
Gabor  trzmsform  is  analogous  to  the  Fourier  transform  in  the  sense  that  it  forms  a 
representation  of  a  function  such  as  /(x,y).  Given  vectors  k  cind  i,  define 

G(k,x)  =  exp(-||  k  f  II  I  ||V8.77)exp(-zfc  •  x) 

where  ||  x  j]  denotes  the  magnitude  of  the  vector  i,  =  —  1,  and  8.77  is  a  parameter 
controlling  window  size.  The  first  factor  is  a  Gaussian;  the  second  is  sensitive  to 
frequency  and  orientation.  One  may  consider  G  to  be  the  transfer  function  of  a  2-D 
filter. 

The  transform  consists  of  a  convolution  of  I  with  G(fc,  x)  for  various  k  at  points 
on  a  grid  superimposed  on  the  image.  This  is  analagous  to  filtering  with  a  bank  of 
filters.  The  filter  responses  form  the  Gabor  transform. 

By  proper  choice  of  the  various  k,  we  czm  search  for  various  features.  If  ||  k  ||  is 
large,  the  filter  will  have  a  high  response  in  the  presence  of  correctly  oriented  high- 
frequency  features;  if  ||  k  ||  is  small,  the  response  is  high  only  in  the  presence  of 
low-frequency  features  with  the  correct  orientation.  The  value  8.77  sets  the  window 
size  to  two  wavelengths.  See  [D]  or  [FT]  for  more  on  this  technique. 

Notice  that  the  filtering  is  local  with  respect  to  the  image,  due  to  the  limited 
Gaussian  window,  so  a  parallel  machine  could  filter  simultaneously  at  all  points  of 
the  superimposed  grid. 

Define  a  characteristic  feature  to  be  the  presence  of  two  or  more  low-level  fea¬ 
tures  at  the  same  scale.  Conceptuzilly,  a  characteristic  feature  is  a  corner  or  the 
presence  of  several  edges  in  a  neighborhood.  The  S-cells  of  Fukushima’s  neocog- 
nitron  essentially  respond  to  chzu-acteristic  features  as  defined  here.  Characteristic 
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features  sire  used  to  classify  objects  in  images.  They  are  scale  and  position  invariant, 
and  can  be  made  rotation  invariant  through  normalization. 

The  system  needs  to  select  “important”  characteristic  features  from  each  im¬ 
age  which  it  is  supposed  to  know.  This  is  done  using  uncertainty  in  the  sense  of 
Information  Theory  ([Sh]).  First,  the  filter  responses  are  quantized.  Let  pj  be  the 
observed  probability  of  response  j  in  the  training  set.  Then  the  uncertainty  at  a 
given  point  on  the  image  is 

where  the  sum  is  taken  over  all  of  the  Gabor  filter  responses  at  each  orientation  at 
the  grid  point  in  question.  It  is  possible  to  calculate  H  in  parallel.  A  low  value  of  H 
at  a  certain  location  indicates  that  there  is  something  “interesting”  or  “informative” 
there.  The  characteristic  features  are  those  with  the  lowest  value  of  H ;  the  lowest 
r  values  are  taken,  where  r  is  a  user  specified  parameter. 

Training  of  the  system  consists  of  performing  the  Gabor  transforms  on  all  of 
the  images,  selecting  the  characteristic  features  from  these  images,  and  training 
some  kind  of  Content  Addressable  Memory  (CAM)  to  recognize  combinations  of 
features.  (There  is  no  reason  to  believe  that  any  give  characteristic  features  will 
be  present  in  only  one  image.)  The  CAM  selected  was  the  Probability  Neural 
Network  (PNN).  A  PNN  is  topologically  similar  to  a  back-propogation  network, 
but  it  is  much  more  mathematically  sophisticated.  There  is  one  hidden  node  for 
each  training  exemplar,  m  well  as  one  for  each  category.  The  weights  from  the 
input  nodes  to  the  hidden  node  representing  the  exemplar  are  exactly  the  inputs 
for  the  exemplar;  thus,  training  is  immediate,  since  there  is  no  need  to  wait  for  some 
training  process  to  converge.  The  transfer  frmction  of  the  exemplar  nodes  is  not  the 
familiar  sigmoid  rather,  it  is  chosen  to  enable  each  output  to  represent  the 

probability  density  for  a  category,  evaluated  at  the  input  in  question.  This  gives 
good  figures-of-merit  for  the  classifications  inferred.  See  [S]  for  more  details. 

VI.  Testing  the  System 

The  system  outlined  above  was  simulated  with  C  programs.  The  first  image 
set  consisted  of  four  32  x  32  pixel  hand-coded  binary  images  representing  aircraft 
planforms.  The  choices  of  k  gave  4  orientations  at  each  of  2  scales,  and  the  su¬ 
perimposed  grid  was  4x4  pixels.  Quantization  seems  to  remove  variations  due  to 
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uniform  changes  of  intensity.  The  grid  cells  whose  value  of  H  was  lowest  generally 
correspond  to  ‘important’  parts  of  the  image,  eg,  wingtips,  fuselage-wing  junctions, 
and  engine  nacelles. 

In  the  training  phase,  as  list  is  made  of  the  characteristic  features  in  the  images. 
Then,  the  niunber  of  each  type  of  characteristic  feattire  in  each  image  is  counted. 
This  count,  after  normalization,  is  the  input  to  the  PNN;  the  output  is  a  score  for 
each  category. 

Here  is  a  sample  training  session  from  the  SPARCstation;  the  user’s  input  is 
in  italics.  The  aircraft  planforms  were  those  of  the  Douglas  DC-8  transport,  the 
F-15  and  F/A-18  fighters,  and  the  P-3  Antisubmarine  Warfare  aircraft.  These 
were  chosen  because  there  is  some  similarity  between  the  various  types,  yet  each  is 
distinct. 

In  the  program  ‘info’,  the  user  is  given  the  option  of  examining  the  character¬ 
istic  features  in  an  image  constructed  as  follows:  the  intensity  in  each  grid  square  is 
inversely  proportion^d  to  the  uncertainty  in  that  square,  so  that  the  most  important 
features  are  the  brightest. 


[JSW]  gabor 

Program  to  compute  the  Gabor  transform  of  an  image. 
Image  size  is  32  by  32. 

Where  is  the  input  file?  f  15. pic 
. . ,  looking  for  file  flS.pic 

Where  should  I  write  the  coefficients?  f  15. gab 

Maximum  modiilus  is  384.925415 

[JSW]  info 

Program  to  interpret  gabor  transforms  of  images. 

Where  is  the  transfrom  file?  f  15.  gab 
. . .  looking  for  file  fl5.gab 


Number  of  quantization  levels?  8 
Where  should  I  write  the  entropy?  fl5.eni 
Number  of  grid  squares  to  display?  5 
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Write  transformed  images  to  files  (y/n)?  n 

Where  is  the  original  image?  f  15. pic 
. . .  looking  for  file  flS.pic 

/*  3  others  processed  the  same  way  */ 

[JSW]  feature 

Featmre  extraction  program.  This  program 
examines  the  *.ent  files  for  the  training  set 
and  creates  a  training  file  for  pnn.c. 

2  scales  and  4  orientations. 

Number  of  features  from  each  transform?  5 
Name  of  the  training  set  planes 
Include  training  narrative  in  output  (y/n)?  y 
Image  to  process?  dc8 

Another  image  (y/n)?  y 
Image  to  process?  fl5 

Another  image  (y/n)?  y 
Image  to  process?  fl8 

Another  image  (y/n)?  y 
Image  to  process?  pS 

Another  image  (y/n)?  n 
Training  file  contains  24  exemplars. 


To  identify  an  unknown  image,  its  Gabor  transform  is  calculated  and  the  im¬ 
age’s  characteristic  features  are  determined.  These  are  compared  with  the  known 
characteristic  features.  The  (normalized)  feattnre  counts  Eire  the  input  to  the  Prob¬ 
ability  Neural  Network. 

With  the  first  training  set,  the  following  unknown  images  were  successfully 
identified:  the  left  half  of  an  F-15,  the  shifted  top  (nose)  portion  of  an  F/A-18,  a 
shifted  DC-8,  various  images  subjected  to  slight  in-plane  rotations,  and  various  im¬ 
ages  with  additive  white  Gaussian  noise  of  fairly  high  power  (SNR  approximately  10 
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dB).  The  system  recognized  slightly  rotated  images  even  though  the  normalization 
for  rotation-invariance  was  not  implemented. 

Another  trianing  set  consisted  of  three  32  x  32  pixel  digitized  images  of  humcin 
faces  with  16  scales  of  gray.  The  system  recognized  the  faces  despite  various  occlu¬ 
sions  and  noise,  although  more  levels  of  quantization  were  needed  to  be  successful. 
The  system  was  also  nm  on  64  pixel  images  of  human  faces  with  256  shades  of  gray, 
thiis  demonstrating  its  scalability. 

[Note:  the  implementation  of  the  Gabor  transform  and  of  the  entropy  calcula¬ 
tions  was  performed  during  the  term  of  this  grant,  as  written  in  the  original  budget. 
Also,  much  experimentation  was  done  with  the  Probability  Neurad  Network  algo¬ 
rithm.  The  algorithm  for  feature  selection  and  the  final  choice  of  a  Probability 
Neural  Network  were  done  under  the  auspices  •  J  a  ^JES  Summer  Faculty  Research 
grant  at  Rome  Air  Development  Center,  Grifiis  ..  FB,  NY.  Following  the  SFRP, 
more  work  was  done  on  the  theoretical  extensions  to  '  system  which  zire  described 
in  the  following  sections  of  this  report.] 

VII.  Another  Approach 

The  work  above  has  suggested  a  new  approach  to  Invzuriant  Pattern  Recogni¬ 
tion.  This  approach  is  based  on  sophisticated  notions  from  the  mathematical  fields 
of  Functional  Analysis  and  Representation  Theory.  These  ideeis  will  be  described 
below. 

The  natural  setting  for  questions  of  invariance  is  the  theory  of  groups.  A 
group  can  be  defined  abstractly,  but  can  b«-  bes.  thought  of  in  this  context  as  a 
set  of  transformations  which  is  closed  imdei  composition.  For  example,  the  set  of 
rotations  of  the  ordinary  (x,y)-plane  about  the  origin  forms  a  group:  two  such 
rotations  can  be  composed  to  yield  a  rotation  (whose  value  is  the  sum  of  the  angles 
through  which  the  component  rotations  move  points).  There  is  an  identity  rotation, 
namely,  rotation  through  0".  Each  rotation  hM  an  inverse,  namely,  the  inverse  of 
the  rotation  through  is  the  rotation  through  — The  set  of  trzmsformations 
acting  on  the  images  we  would  like  to  identify  —  namely  rotations,  translations, 
amd  scalings  —  also  forms  a  group. 

We  say  a  group  acts  on  a  set  if  it  transforms  elements  of  the  set  to  elements  of 
the  set.  If  all  elements  of  the  group  G  which  acts  on  a  set  X  are  eillowed  to  act  on 
some  element  x,  the  set  obtained  is  called  the  orbit  of  i  and  denoted  Gx. 
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For  example,  imagine  that  G  is  the  group  of  rotations  of  the  plane  about  the 
origin  (usually  denoted  0(2)).  Then  the  orbit  of  some  point  (xo»yo)  is  the  circle  of 
radius  r  =  y^. 

Now,  suppose  we  have  a  group  G  acting  on  a  set  X  and  a  set  of  patterns 
{Ii, ...  Ir}  in  X.  To  identify  an  unknown  pattern  in  a  way  which  is  invariant  under 
the  action  of  0,  we  propose  to  precompute  all  of  the  orbits  {Gfy}.  Then  the  task 
of  identification  of  an  unknown:  pattern  U  becomes  the  taslr  of  finding  the  orbit 
closest  to  [/.  This  procedure  is  automatically  robust  with  respect  to  noise  because 
the  noisy  version  of  If  is  probably  close  to  U. 

For  example,  suppose  that  we  had  pattmis  represented  as  points  in  the  (x,  y)- 
plane,  say  (xi ,  yi ), . . . ,  (x^,  y^),  and  we  would  like  to  recognize  these  patterns  even 
though  they  may  have  been  transformed  by  the  action  of  the  rotation  group  0(2). 
In  this  case,  we  can  precompute  the  orbits  by  computing  their  radii:  =  y/xj  +  yf. 
To  identify  an  unknown  pattern  (x,  y),  we  compute  its  radius  r  =  y/x^  +  y^,  and 
determine  which  rj  is  closest  to  r.  This  tells  us  which  (xj,  yj)  was  rotated  to  obtain 
(i,y).  If  r  =  2  and  (xi,yi)  =  (1,1)  while  (x2,y2)  =  (2,-4),  we  conclude  that  the 
pattern  (—3,4)  is  a  rotated  noisy  version  of  (2,  —4)  since  5  is  closer  to  them  it 
is  to  \/2.  This  is  impressive  when  one  considers  that  (—3, 4)  is  much  closer  to  (1, 1) 
than  it  is  to  (2,  —4)  (5  versus  >/§9). 

In  image  recognition,  it  is  probably  not  the  case  that  we  want  to  let  X  be 
the  plane  and  then  consider  the  orbits  of  the  images  we  plem  to  remember.  More 
likely,  we  can  achieve  significant  compression  (and  subsequent  computationsil  im¬ 
provement)  bv  considering  some  repiesentation  of  the  group  in  question,  A  repre¬ 
sentation  ot  ci  group  of  degree  n  is  an  n  x  n  matrix  associated  to  each  group  element 
in  such  a  way  that  the  group  composition  becomes  matrix  multiplication.  It  is  a 
fundamental  fact  that  every  group  has  many  representations  of  various  degrees.  For 
details  consult  [Serrej. 

One  way  to  approach  the  use  of  group  representations  for  Invariant  Pattern 
Recognition  is  through  the  selection  of  featme  vectors.  Suppose  that  each  image 
Ij  has  an  i — dimensional  feature  vector  Vj  associated  to  it,  and  suppose  that  we 
have  a  degree  r  representation  of  the  relevant  group.  Then  the  orbit  of  Ij  can 
be  computed  by  performing  repeated  matrix  multiplication  of  Vj  by  the  matrices 
representing  the  group.  If  we  have  a  good  ‘sampling  theorem’  (which  would  tell  us 
how  many  samples  of  the  image  we  need  for  accurate  reconstruction),  we  could  in 


fact  consider  the  orbits  to  be  a  finite  set  of  points  in  Each  such  poini  could  be 
assigned  to  a  (virtual)  processing  element  of  a  suitable  parallel  computer,  ie,  each 
processing  element  would  remember  the  name  of  the  image  in  whose  orbit  it  lies. 
Recognition  then  consists  of  finding  the  feature  vector  for  the  unknown  image,  then 
spreading  out  along  r  —  1-dimensional  h)rperspheres  until  an  orbit  is  encoimtered. 

The  amazing  thing  about  such  a  system  is  that  the  speed  of  identification 
improves  as  more  patterns  are  learned.  This  is  because  the  average  distance  from 
any  point  to  one  of  the  patterns  must  go  down  as  the  density  of  ‘known’  points 
increases. 


Figure  1 

Implementation  of  such  a  strategy  depends  on  an  appropriate  and  eiisy-to- 
compute  scheme  for  determining  feature  vectors.  A  strong  candidate  for  such  a 
scheme  is  the  theory  of  wavelets  (see  (MJ).  A  wavelet  scheme  is  an  orthonormal 
basis  for  Z/^(R^),  which  means  that  it  is  easy  to  express  a  given  image  in  terms  of 
this  basis.  According  to  [H]  there  are  appropriate  sampling  theorems,  so  a  finite¬ 
dimensional  feature  vector  is  possible. 

It  Hicy  be  objected  that  a  wavelet  scheme  is  too  abstract,  but  in  fact  the 
wavelets  have  strong  intuitive  meanings  similar  to  Gabor  coefficients.  Thus,  a 
wavelet  can  be  used  to  perform  some  of  the  low-level  feature  detection  employed  in 
the  system  discussed  above  as  well  as  in  the  neocognitron. 

In  implementing  such  a  scheme  it  is  important  to  consider  irreducible  repre- 
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sentations.  Every  representation  of  a  group  G  can  be  decomposed  into  a  ‘direct 
sum’  of  irreducible  representations.  Operationally  this  means  that  there  is  a  basis 
for  R’’  such  that  the  matrices  for  each  group  element  decompose  into  block  form. 
This  has  computational  consequences:  if  a  representation  of  degree  m  +  n  can  be 
decomposed  into  representations  of  size  m  and  n,  then  the  number  of  matrix  entries 
needed  drops  trom  +  2mn  +  to  m*  -H  n*,  a  saving  of  2mn. 

The  problem  is  that  the  some  of  the  groups  which  arise  in  this  context  are 
not  ‘nice’,  so  much  theoretical  work  must  be  done  to  determine  irreducible  sub¬ 
representations.  This  leads  to  an  additional  problem:  write  computer  programs  to 
determine  irreducible  representations.  Much  combinatorial  machinery  from  Repre¬ 
sentation  Theory  is  available  to  help  with  this  task. 

Another  consideration  is  invariant  theory.  In  our  example  of  the  rotation  group 
0(2)  acting  on  the  plane,  we  determined  that  each  orbit  was  uniquely  determined 
by  its  radius.  Invariant  theory  tries  to  determine  such  ‘invariants’  for  other  group 
actions.  In  particular,  the  invariant  theory  of  the  group  of  perspective  transforma¬ 
tions  is  well-understood,  and  there  have  been  some  efforts  to  apply  results  from 
this  area  to  image  recognition. 

All  of  these  ideas  should  be  vigorously  pursued  on  an  appropriate  parallel 
computer. 

VII.  Conclusions 

In  this  grant  effort,  the  use  of  Neural  Network  techniques  in  Invariant  Pattern 
Recognition  has  been  investigated.  The  neocognitron  was  considered,  and  a  system 
based  on  Gabor  transform  preprocessing.  Information  Theoretic  feature  selection, 
and  the  Probability  Neural  Network  was  demonstrated  to  give  very  good  perfor¬ 
mance.  The  success  of  this  effort  leads  to  ideas  to  use  wavelets  and  representation 
theory  to  design  a  parallel  pattern  recognition  system  which  would  be  very  fast 
(with  speed  increasing  when  more  patterns  are  learned),  invariant,  and  robust  with 
respect  to  noise. 
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Abstract 

Infrared  detector  performance  can  be  changed  when  exposure  to  normal 
blackbody  radiation  is  replaced  by  monochromatic  radiation.  The  possi¬ 
ble  difference  in  detectivity  occurs  because  the  photon  noise  limit  may  be 
either  raised  or  lowered.  If  the  detector  is  operating  near  the  BLIP  limit, 
then  its  performance  may  be  degraded  if  that  limit  is  lowered  significantly. 
Alternatively,  if  the  BLIP  limit  is  raised,  then  no  significant  effect  on  the  de¬ 
tector’s  performance  is  anticipated.  The  radiation  wavelength,  detector  cut¬ 
off  wavelength,  detector  quantum  efficiency,  system  bandwidth,  and  incident 
background  photon  flux  from  the  lab  combine  to  determine  the  magnitude 
and  direction  of  the  change.  Such  effects  are  in  theory  easily  quantifiable 
and  probably  will  be  significant  for  the  development  of  a  simulated  mission 
testing  capability  which  uses  laser  radiation. 
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1  Introduction 


Arnold  Engineering  Development  Center  in  Tullahoma,  Tennessee  performs 
tests  on  infrared  detectors  that  are  intended  for  use  in  a  wide  range  of  mis¬ 
sions.  Currently,  scientists  at  Arnold  Center  are  developing  a  revolutionary 
system  for  testing  IR  detectors  under  various  simulated  mission  environ¬ 
ments.  Their  direct  write  scene  generation  (DWSG)  system  uses  a  laser 
beam  to  “paint”  the  anticipated  scene  onto  the  pixel  elements  of  the  detec¬ 
tor  array. 

When  doing  testing  under  simulated  conditions,  the  validity  of  the  sim¬ 
ulation  must  always  be  carefully  considered.  In  the  case  of  the  DWSG.  one 
area  of  concern  is  the  ramifications  of  using  a  monochromatic  laser  to  portray 
the  multiple  wavelengths  of  radiation  inherent  in  virtually  any  realistic  scene. 
.4n  effort  has  been  made  to  address  this  issue  from  a  theoretical  standpoint. 

The  theory,  as  discussed  in  the  ensuing  sections,  indicates  that  if  the 
detector  is  intended  to  work  at  or  near  the  fundamental  noise  limit  which 
arises  from  the  quantum  nature  of  light,  then  the  simulation  could  easily  be 
affected.  It  is  possible  to  calculate  beforehand  when  these  effects  are  likely 
to  occur  and  to  what  degree  they  will  alter  the  simulation.  The  second  half 
of  this  paper  is  devoted  to  such  calculations. 

Before  the  implications  for  the  DWSG  will  have  full  meaning,  their  source 
must  be  thoroughly  understood.  The  potential  problems  are  deeply  rooted  in 
the  basic  theory  of  the  ultimate  limits  on  photon  detection.  Consequently,  the 
first  half  of  this  paper  is  devoted  to  a  careful  exposition  of  the  relevant  theory. 
.Although  this  material  is  not  original,  it  should  be  a  useful  improvement 
over  what  is  found  in  the  literature.  Meticulous  care  has  been  taken  to  use 
notation  that  will  avoid  the  ambiguity  that  often  plagues  this  subject.  .Also, 
an  error  common  to  at  least  two  standard  references  has  been  identified. 
.And  finally,  the  present  analysis  should  be  complete  enough  to  be  etisily 
understood,  yet  it  empheisizes  those  facets  that  are  of  particular  importance 
for  the  DWSG  system. 


2  Basic  Theory 

This  section  consists  mainly  of  material  derived  from  standard  references[l- 
6].  It  will  provide  the  necessary  foundation  for  the  calculations  which  have 
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been  performed  for  che  DWSG  system  and  are  discussed  in  Part  3. 

2.1  Photon  Limited  Detection 

In  this  paper,  we  will  only  be  considering  ideal  detectors.  This  restriction 
allows  us  to  focus  our  attention  on  effects  which  originate  in  the  illuminating 
radiation.  The  ideal  detector  has  no  internal  noise;  no  amplifier  noise,  no 
thermal  noise,  and  no  noise  due  to  defects  inherent  in  the  detector  material 
itself.  Only  the  fundamental  noise  which  is  caused  by  the  discrete  nature 
of  photons  remains.  Because  the  individual  photons  will  arrive  at  random 
times,  the  signal  generated  by  the  detector  will  likewise  fluctuate  randomly. 
If  this  is  indeed  the  only  noise  present,  then  the  detection  is  said  to  be  photon 
limited. 

Before  continuing,  let  us  define  some  symbols.  The  incident  photon  flux  is 
represented  by  <f)‘'{X)  which  has  the  units  of  photons  per  second.  The  symibol 
q  indicates  the  flux  is  in  terms  of  quanta  of  light  per  second  rather  than 
power.  The  functional  dependence  on  A  will  always  be  used  as  an  explicit 
declaration  of  monochromatic  radiation.  In  this  case,  all  photons  incident  on 
the  detector  have  the  same  wavelength  A. 

The  number  of  photoelectrons  generated  per  second  in  the  detector  will 
be  given  by  where  q  is  the  quantum  efficiency^  of  the  detector.  Thus, 

if  e  is  the  electronic  charge,  the  average  total  current  generated  is: 

i  =  (1) 

The  infrared  radiation  incident  on  the  detector  consists  of  discrete  quanta. 
Therefore,  these  photons  will  arrive  at  some  average  rate  with  a  Poisson 
distribution  about  the  average.  It  can  be  shown  (Ref.  1,  p.  132  and  Ref.  3, 
p.  13)  that  the  rms  of  the  deviation  Ai  is  given  by  the  formula: 

=  v/2elA/.  (2) 

This  formula  represents  what  is  often  called  shot  noise  and  was  originally 
derived  by  Schottky^  to  describe  the  noise  inherent  in  the  current  pcissing 

'We  will  make  the  common  assumption  that  q  is  independent  of  A  below  the  cutoff 
wavelength  of  the  detector. 

■W.  Schottky,  Ann.  Phys.  (Leipzig)  57,  541  (1918). 


through  a  temperature-limited  vacuum  diode.  W^e  have  chosen  A/  to  rep¬ 
resent  the  electrical  equivalent  bandwidth  of  the  system.  The  shot  noise 
formula  gives  us  the  noise  generated  by  the  discrete  nature  of  the  photons 
striking  the  detector.  This  fundamental  noise  will  always  be  present,  even 
for  an  ideal  detector  on  which  only  signal  photons  are  incident. 

2.2  Signal  Photon  Limited  Detection 

Consider  a  perfect  detector  when  there  is  no  background  photon  flux.  The 
signal  current  is  given  by  Eq.  (1).  We  will  add  the  subscript  s  to  ^''(A) 
in  order  to  indicate  that  it  is  the  signal  photon  flux  0’(A).  Because  there  is 
no  other  noise  source,  all  noise  is  due  to  the  random  arrival  rate  of  signal 
photons.  The  total  rms  noise  current  is  given  by  Eq.  (2)  when  7  is  i,. 
Dividing  the  two  equations  yields  the  signal- to-noise  ratio: 


Because  the  spectral  noise  equivalent  power  NEP{X)  is  the  power  flux 
<?!>f  (A)  when  S/N  =  1,  we  need  to  set  the  above  equation  equal  to  unity  and 
solve  for  the  photon  flux:^ 

2Af 

m)  =  (4) 

V 

Then  we  convert  to  power  flux  by  using  the  energy  he/ A  of  a  single  photon: 

NEP{X)  =  (5) 

This  equation  for  NEP{\)  represents  the  best  value  possible  for  a  detec¬ 
tor.  Of  course  in  practice,  a  situation  where  no  background  photons  exist  is 
seldom  achieved.'* 

^The  term  “spectral”  and  the  dependence  on  A  is  used  to  indicate  explicitly  that  the 
signal  providing  the  NEP  is  monochromatic. 

''In  fact,  the  observant  reader  will  recognize  that  the  linear  dependence  of  N EP(X)  on 
A/  given  by  Eq.  (5)  is  not  the  usual  relationship  that  is  assumed  for  the  calculation  of 
D-(A). 
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2.3  Background  Photon  Limited  Detection  or  BLIP 

If,  in  addition  to  signal  photons,  background  photons  are  allowed  to  fall 
on  a  detector,  then  more  photoelectrons  are  generated  than  before.  These 
photoelectrons  contain  no  more  signal  information  so  they  only  contribute 
to  the  noise: 

U  =  eTi(j>l{T).  (6) 

But,  if  this  background  generated  “noise  current”  4  were  truly  a  DC  current, 
it  would  not  be  a  noise  at  all  because  it  would  simply  be  a  constant  offset  to 
the  signal  current.  Unfortunately,  there  is  a  Poisson  distribution  around  the 
average  value  tj,  exactly  like  the  fluctuation  around  ij,.  Thus, 


and  substituting  Eq.  (6), 

=  e/2AMl{T).  (8) 

Note  that  in  the  above  equations  we  have  indicated  the  background  flux 
depends  on  T.  Clearly,  photons  generated  from  background  sources  are  not 
monochromatic  but  have  a  broad  range  of  wavelengths  that  are  given  by  the 
Planck  distribution  for  blackbodies.  If  the  entire  background  was  of  one  tem¬ 
perature,  then  it  would  be  T.  Otherwise,  the  situation  is  more  complicated 
and  the  argument  T  should  merely  be  taken  to  indicate  nonmonochromatic 
flux.  Note  also  the  implicit  assumption  that  4>1{T)  includes  photons  only  up 
to  the  detector’s  cutoff  wavelength. 

When  the  background  flux  generates  noise  which  dominates  the  system, 
then  the  detector  is  said  to  be  operating  under  BLIP  conditions.®  Occasion¬ 
ally  the  term  BLIP  is  used  in  a  way  that  implies  a  more  general  meaning, 
such  as  “the  best  performance  possible.”  Technically,  this  is  incorrect.  Under 
rare  conditions,  a  detector  can  perform  better  than  BLIP  if  the  noise  due  to 
the  signal  photons  is  greater  than  the  noise  due  to  the  background.  Since  we 
are  currently  dealing  with  BLIP  conditions, 


^Although  the  extict  identification  of  the  acronym  varies  from  one  author  to  another, 
perhaps  the  most  common  is  Background  Limited  Infrared  Performance. 
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and 


0l{T)  »  4>1{X).  (10) 

Under  BLIP  conditions,  because  of  Eq.  (9),  the  equation  for  total  noise 
current  does  not  need  to  include  the  noise  associated  with  fluctuations  in 
signal  current.  Thus,  Eq.  (8)  gives  the  total  noise  current: 


i„  =  e^2A/#?(T). 
Now  the  signal-to-noise  ratio  becomes 


S/N  = 


i± 


Setting  S/N  =  \  and  solving  for  the  required  signal  flux  yields 


(11) 


(12) 


m)  = 


(13) 


Converting  from  photon  flux  to  power  flux  enables  us  to  find  NEP{\)\ 


This  expression  for  NEP{X)  under  BLIP  conditions  agrees  with  that  ob¬ 
tained  by  Dereniak  and  Crowe  (Ref.  2,  p.  47)  and  Kruse  et  al.  (Ref.  5,  p. 
358).®  However,  Kingston  (Ref.  3,  p.  16)  and  Boyd  (Ref.  1,  p.  135)  find  a  dif¬ 
ferent  dependence  on  wavelength.  Either  Kingston  and  Boyd  are  misleading 
in  their  notation,  or  they  have  errored  by  blurring  the  distinction  between 
monochromatic  photon  flux  and  the  blackbody  spectrum  typically  exhibited 
by  the  background.  Let  us  be  perfectly  clear:  A  in  Eq.  (14)  is  the  wavelength 
of  the  monochromatic  signal  photons  and  has  no  relation  to  the  background 
photon  flux  <f>l{T).  This  point  will  prove  to  be  very  important  later  on  when 
we  do  calculations  for  the  DWSG. 

The  spectral  detectivity  D'{X)  is  simply  the  reciprocal  of  NEP{X)  mul¬ 
tiplied  by  the  square  root  of  the  detector  area  A  and  the  square  root  of  the 

®The  expression  in  Ref.  5  utilizes  Bose- Einstein  statistics.  This  correction  is  usually 
ignored  at  wavelengths  shorter  than  20  pm. 
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Here  the  incident  photon  flux  per  unit  area  (or  “flux  density”)  is  simply 

m)iA- 

The  above  equation  forms  the  theoretical  basis  for  the  graph  of  D'{\) 
versus  A  that  has  been  immortalized  by  The  Infrared  Wall  Chart  produced 
by  SBRC  and  reproduced  in  Fig.  1.  When  the  cutoff  wavelength  of  the  detec¬ 
tor  Ac  is  used  in  place  of  A,  then  Eq.  (16)  delineates  the  line  that  represents 
the  best  performance  possible  for  an  ideal  detector^  under  BLIP  conditions. 
According  to  the  first  term,  the  spectral  detectivity  increases  in  proportion 
to  Ac,  however,  because  of  the  second  term,  it  has  a  more  complicated  depen¬ 
dence  on  Ac  that  comes  through  the  background  photon  flux.  Recall  that  the 
total  background  flux  4>1{T)  is  integrated  up  to  Ac-  If  the  background  is  300 
K,  then  the  number  of  background  photons  received  increases  dramatically 
with  cutoff  wavelength  below  10  Thus,  the  detectivity  decreases.  But, 
because  of  the  shape  of  the  Planck  distribution,  very  few  additional  back¬ 
ground  photons  are  acquired  cis  the  cutoff  wavelength  is  increeised  beyond 
10  /im,  and  so  the  detectivity  increases. 

If,  on  the  other  hand,  the  cutoff  wavelength  of  the  detector  is  kept  con¬ 
stant,  then  <pI{T)  is  also  constant.  In  this  case,  A  represents  the  wavelength 
of  the  monochromatic  signal,  and  according  to  Eq.  (16),  D'[\)  is  directly 
porportional  to  A.  Once  the  signal  wavelength  is  greater  than  the  detector’s 
cutoff  wavelength,  then  obviously  the  detectivity  plummets  because  the  sig¬ 
nal  cannot  be  seen.  This  accounts  for  the  general  sawtooth  shape  of  the 


'Only  photovoltaic  detectors  can  in  theory  achieve  the  performance  indicated  by  Eq. 
(16).  For  photoconduciors,  D’{X)  must  be  less  by  a  factor  of  l/\/2,  os  indicated  in  Fig. 
1.  This  is  because  photoconductors  suffer  from  generation -recombination  noise.  Not  only 
is  there  a  shot  noise  associated  with  the  generation  of  pho>  electrons,  but  there  is  also  a 
shot-like  noise  caused  by  the  recombination  of  the  photoelectrons.  These  two  independent 
noise  processes  in  effect  cause  the  2  in  Eq.  (7)  to  become  a  4  (cf.  Ref  1,  pp.  165-169  or 
Ref  6,  p.  62). 
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experimental  curves  for  the  various  detectors  that  appear  on  The  Infrared 
Wall  Chart. 

For  the  sake  of  completeness,  it  should  be  pointed  out  that  the  spectral 
detectivity  which  is  obtained  for  a  monochromatic  signal  source,  may 

be  converted  to  the  blackbody  detectivity®  D’{T).  The  latter  is  observed 
when  the  signal  is  provided  by  a  blackbody  source  of  temperature  T.  The 
appropriate  conversion  equations  and  graphs  may  be  found  in  the  literature[2, 
4,  5,  and  7|. 

3  Implications  for  the  DWSG  System 

The  basic  theory  that  was  developed  in  Part  2  will  be  used  to  make  cal¬ 
culations  specifically  for  the  DWSG  system.  As  the  results  are  obtained,  a 
discussion  of  their  significance  will  be  given. 


3.1  What  are  the  Issues? 

The  previous  sections  consisted  of  a  careful  consideration  of  the  theoretical 
limits  for  infrared  detectors.  In  this  analysis,  we  found  equations  which  give 
the  spectral  noise  equivalant  power  and  the  spectral  detectivity  under  BLIP 
conditions.  The  equations  show  how  these  figures  of  merit  depend  on  the 
wavelength  of  monochromatic  signal  radiation. 

When  calculating  BLIP  D*{\)  or  BLIP  NEP{X),  it  is  assumed  that  the 
wavelength  of  the  monochromatic  signal  radiation  is  the  same  as  the  cutoff 
wavelength  Xc  of  the  detector.  This  assumption  is  made  simply  because 
the  ideal  detector  will  have  maximum  response  at  the  cutoff  wavelength.® 
However,  when  the  DWSG  is  used  to  test  a  detector,  it  is  exceedingly  unlikely 
that  the  monochromatic  laser  radiation  will  just  happen  to  be  near  the  cutoff 
wavelength.  Consequently,  the  figures  of  merit  for  BLIP  observed  under  these 
special  test  conditions  cannot  have  the  same  values  as  the  usual  figure  of 

®Apparently,  spectral  detectivity  is  the  detectivity  that  is  normally  reported,  although 
this  is  not  always  explicitly  stated.  The  usual  notation  is  DJ,  but  the  subscript  is  some¬ 
times  left  out. 

®In  practice,  Xc  is  usually  defined  as  the  wavelength  at  half  the  peak  responsivity  (i.e., 
detectivity),  but  for  most  real  detectors  the  difference  between  and  the  peak  wavelength 
is  quite  small. 
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merits.  Another  big  issue  is  the  source  of  background  photons.  The  previous 
sections  assume  a  blackbody  distribution  when  calculating  the  figure  of  merit, 
but  as  we  shall  see  this  will  not  always  be  appropriate  in  the  DWSG.  This 
difference  will  also  change  the  values  for  the  figures  of  merit  at  BLIP. 

Will  the  detectors  be  in  BLIP?  Well,  when  testing  IR  devices,  it  usually 
is  desirable  to  keep  the  background  flux  very  low.  The  goal  is  to  allow  the 
testing  to  reveal  the  quality  of  the  detector — that  is,  the  device  noise.  If  this 
is  the  case,  then  the  detector  will  not  be  working  under  BLIP  conditions. 
But,  the  DWSG  is  designed  to  test  the  detector  under  simulated  mission 
conditions.  This  circumstance  is  very  different  than  normal  laboratory  test¬ 
ing.  If  the  designer  of  the  IR  detection  system  was  able  to  minimize  system 
and  device  noise,  then  it  would  seem  likely  that  the  detector  is  working  very 
close  to  BLIP  Certainly  this  would  be  the  goal  of  the  system  engineer  who 
wishes  to  maximize  performance. 

The  danger  here  is  that  when  simulating  the  mission  environment  with 
a  monochromatic  laser  beam  significantly  different  from  A,;,  the  noise  (as 
indicated  by  the  figures  of  merit)  will  be  artificially  raised  because  of  a  change 
in  the  BLIP  limit.  There  seems  to  be  three  different  situations  that  can  arise: 

•  The  background  flux  from  the  lab  is  sufficient  to  put  the  detector  into 
BLIP. 

•  The  laser  beam  is  used  to  “paint”  a  background  according  to  mission 
expectations  and  the  background  flux  from  the  room  is  negligible. 

•  The  laser  beam  is  used  to  “paint”  a  background  according  to  mission 
expectations  and  the  background  flux  from  the  room  is  nonnegligible. 

Each  of  the  following  sections  will  be  devoted  to  one  of  these  cases. 

3.2  Laboratory  Background  Flux  Causes  BLIP  Con¬ 
ditions 

This  case  is  the  simplest  to  deal  with  and  perhaps  the  most  unrealistic.  The 
background  flux  is  essentially  a  blackbody  distribution  as  before,  and  we 
take  the  laser  photons  to  contain  nothing  but  signal  information.  Now  the 
monochromatic  signal  flux  used  in  all  the  equations  from  the  theory  sections 
is  provided  by  the  laser.  The  big  difference,  however,  is  that  this  laser  is 
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operating  at  a  wavelength  different  than  the  cutoff  wavelength  that  is  used 
in  calculating  the  best  possible  performance  of  the  detector.  What  we  must 
do  is  find  equations  that  will  show  how  the  performance  under  DWSG  testing 
relates  to  the  normal  figures  of  merit. 

We  assume  that  the  lab  background  flux  which  is  large  enough  to  put 
the  detector  into  BLIP,  is  also  significantly  greater  than  the  laser  signal  flux. 
Thus  the  conditions  of  Eqs.  (9-10)  are  met.  Under  these  conditions,  the 
spectral  noise  equivalent  power  that  will  be  observed  is  given  by  Eq.  (14); 

he 

NEP{X,)  =  - 
A/ 

Here  we  have  put  the  subscript  /  on  the  wavelength  to  denote  the  signal  as 
being  generated  by  the  laser.  The  normal  situation,  where  the  wavelength  is 
Ac,  is  simply  given  by: 

NEP(X,}  =  j 

The  relationship  between  these  two  figures  of  merit  is  easily  seen  to  be: 

NEP{K)  =  ^NEP{\,).  (19) 

By  using  the  definition  of  spectral  detectivity,  given  in  Eq.  (15),  it  is  easy  to 
put  the  above  relationship  in  terms  of  D*{X): 

D-(\,)  =  (20) 

What  do  the  last  two  equations  mean?  According  to  Eq.  (19),  if  A//Ac  < 
1,  then  the  leiser  will  have  to  provide  more  power  at  A/  than  that  required 
at  Ac  to  get  the  same  signal-to-noise  ratio.  By  the  same  token,  if  Aj/Ac  >  1, 
then  the  laser  will  need  to  provide  less  power.*® 

If  the  laser  wavelength  is  less  than  the  cutoff  wavelength,  then  the  detector 
will  behave  as  if  its  detectivity  has  been  reduced  by  a  factor  of  A;/ Ac.  Another 

*°It  is  my  understanding  that  occasionally  the  laser  radiation  from  the  DWSG  is  at  a 
higher  wavelength  than  the  intended  cutoff  wavelength  of  the  detector.  This  is  accom¬ 
plished  by  altering  the  temperature  of  the  detector  so  it  will  be  sensitive  at  the  higher 
wavelength. 
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way  of  thinking  about  this  correction  is  to  consider  the  effect  it  has  on  Fig. 
1.  The  line  that  represents  the  performance  of  an  ideal  photovoltaic  detector 
operating  at  BLIP  will  be  scaled  down  (see  Fig.  2)  by  the  same  factor  Xt/\c- 
In  other  words,  when  testing  with  the  DVVSG  the  detector  will  have  a  different 
ideal  limit. 

In  practice,  detectors  do  not  quite  achieve  the  BLIP  limit.  Thus,  if  the 
limit  is  raised  under  testing  it  will  have  no  effect.  Lowering  the  limit  will 
necessitate  a  correction  if  the  test  condition  BLIP  limit  becomes  lower  than 
the  detector’s  limit.  The  correction  will  not  be  quite  as  simple  as  Eq.  (20) 
indicates  because  it  is  necessary  to  know  the  detector’s  actual  limit  under 
normal  conditions  beforehand. 

3.3  Entire  Incident  Photon  Flux  Generated  by  the 
Laser 

When  using  the  DWSG  to  test  an  IR  system  in  a  mission  environment, 
the  laser  will  be  used  to  “paint”  the  expected  scene.  Obviously,  most  of 
this  scene  will  consist  of  what  would  normally  be  called  background  and, 
as  discussed  in  Section  3.1,  should  put  the  detector  near  BLIP.  Because 
the  background  is  intentionally  painted  by  the  laser,  this  condition  should  be 
termed  “simulated  BLIP.”  '~'he  background  photons  responsible  for  simulated 
BLIP  are  not  merely  the  normal  photons  with  a  blackbody-like  distribution, 
but  they  include  the  monochromatic  Iciser- generated  photons. 

We  will  now  consider  the  case  where  virtually  all  photons  incident  on  the 
detector  are  generated  by  the  laser,  and  the  flux  from  the  lab  is  negligible. 
Such  a  situation  is  exactly  the  opposite  of  that  expressed  by  Eqs.  (9-10). 
Now  we  have 

oUT)  «  (21) 

Note  that  all  the  photons  generated  by  the  laser  are  legitimately  considered 
part  of  the  signal  flux.  After  all,  the  detector  has  no  way  of  distinguishing  be¬ 
tween  those  laser  photons  that  represent  the  target  and  those  that  represent 
the  background. 

We  are  really  working  in  the  signal  photon  detection  limit  described  in 
Section  2.2.  Under  these  unusual  circumstances,  Eq.  (5)  gives  the  value  for 


the  spectral  noise  equivalent  power  that  would  be  observed: 

NEP{\i)  =  (22) 

The  normal  situation  has  a  spectral  noise  equivalent  power  given  by  Eq.  (18). 
A  little  algebra  will  show  that  these  two  equations  are  related  in  the  following 
way: 

NEP(K)  =  (23) 

Once  again,  Eq.  (15)  allows  us  to  easily  convert  to  spectral  detectivity: 


D’ih) 


K 


nPliT) 

2  A/ 


D’iXc). 


(24) 


The  interpretation  of  these  two  relationships  is  only  slightly  more  difficult 
than  in  the  last  section.  The  ratio  Aj/A^  plays  the  same  role.  But,  due  to 
the  fundamental  difference  between  the  monchromatic  laser  photons  and  the 
usual  blackbody  background  radiation,  there  is  an  additional,  complicated 
correction  term.  This  term  depends  on  the  quantum  efficiency  of  the  detec¬ 
tor,  the  bandpass  of  the  IR  system,  and  the  stray  blackbody  flux  from  the 
lab.  The  dependence  on  t]  and  A/  is  inconvenient  because  these  variables 
will  change  for  different  detectors  and  systems.  The  dependence  on  <f>l{T)^ 
however,  is  unlikely  to  cause  any  difficulties  betfause  it  should  be  a  constant 
depending  only  on  the  DWSG. 

In  order  to  understand  the  implications  of  Eqs.  (23-24)  more  fully,  let  us 
simpUfy  things  by  considering  a  very  bland  scene.  This  simplification  allows 
us  to  make  a  very  rough  order  of  magnitude  calculation  to  get  some  feel 
for  the  minimum  size  of  the  new  correction  term.  Assume  a  small  quantum 
efficiency  of  0.2.  Assume  the  background  flux  density  from  the  lab  is  the 
10'°  photons/cm^/sec  measured  for  the  baseline  DWSG.  A  typical  detector 
element  has  an  area  of  4  x  10“®  cm^.  Then,  we  find  the  flux  is  </>’(T’)  =  4  x  10® 
photons/sec.  An  estimate  of  bandwidth  is  more  difficult  for  this  writer.  It 
has  been  shown[8]  that  in  theory  the  fastest  possible  response  time  of  a 
typical  HgCdTe  detector  is  ~  0.2/xsec.  This  speed  sets  an  upper  limit  on 
the  bandwidth  of  ~  1  MHz."  Plugging  these  numbers  into  the  radical  of 

^‘For  a  system  with  an  exponential  decay  time  r,  the  equivalent  electrical  bandwidth 
A/  is  l/4r  (cf.  Ref.  1,  p.  115). 
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Eq.  (24)  should  provide  the  smallest  estimate  of  the  minimum  value  the  new 
correction  term  can  have: 


(25) 


If  A//Ac  is  slightly  less  than  one,  as  will  usually  be  the  case,  then  the 
correction  term  will  be  almost  an  order  of  magnitude  less  than  unity.  At 
first  glance  this  would  indicate  a  significant  reduction  of  the  BLIP  limit  un¬ 
der  DWSG  testing.  Thus,  detectors  would  appear  to  have  a  detectivity  that 
is  artificially  reduced  even  further  than  the  reduction  indicated  under  the 
conditions  of  the  last  section.  However,  the  assumptions  that  went  into  the 
order  of  magnitude  calculation  of  Eq.  (25)  were  chosen  to  yield  the  extreme 
minimum  correction  factor.  It  is  unlikely  that  this  term  would  ever  be  so 
small.  Although  it  i:i  very  possible  that  the  lab  background  flux  may  even¬ 
tually  be  reduced  below  the  baseline  levels,  this  change  will,  hopefully,  still 
be  swamped  by  the  over-estimate  of  the  bandwidth. 

The  author  believes  that  the  correction  term  will  always  be  greater  than 
unity.  Consequently,  under  these  conditions  the  BLIP  limit  will  always  be 
raised  during  testing  with  the  DWSG.  This  result  may  seem  surprising  at 
first,  but  on  closer  inspection  it  is  not  unreasonable.  A  monochromatic  source 
is  providing  the  photons  that  under  mission  conditions  would  be  generated  by 
objects  that  emit  a  wide  range  of  wavelengths.  It  seems  physically  reasonable 
to  expect  a  simple  “black  and  white”  signal  to  have  inherently  less  noise  than 
a  “multicolor”  signal.  If  this  is  true,  and  the  IR  detector  is  limited  only  by 
BLIP,  then  the  detector  would  output  a  cleaner  signal  under  simulated  tests 
than  it  will  on  the  actual  mission. 

A  higher  BLIP  limit,  however,  will  not  be  a  difficulty  for  the  DWSG. 
Real  detectors  do  not  quite  achieve  the  BLIP  limit.  Consequently,  artificially 
reducing  the  fundamental  noise  due  to  the  quantum  nature  of  light  shcuid  not 
enhance  the  detector’s  perfomance.  The  detector  is  dominated  by  .oise  from 
other  sources  and  will  continue  to  exhibit  essentially  the  same  detectivity 
even  as  the  BLIP  limit  is  raised. 
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3.4  Incident  Flux  Generated  by  Both  Lab  Background 
and  Laser 


When  the  background  flux  seen  by  the  detector  is  generated  both  by  the  lab 
and  the  laser,  then  the  situation  is  a  combination  of  the  cases  discussed  in 
the  last  tv.o  sections.  Not  surprisingly,  things  are  now  a  bit  more  complex. 
Neither  Eq.  (21)  nor  Eq.  (10)  is  valid.  The  background  flux  from  the  lab, 
and  the  laser  generated  signal  photon  flux,  must  be  considered  comparable: 

OUT)  ~  .^’(A;).  (26) 

The  following  calculation  will  proceed  in  much  the  same  way  as  the  one 
in  Section  2.3.  As  before,  the  signal  current  is  simply 


is  =  eriOliXi). 


(27) 


But  now,  the  noise  current  which  was  given  by  Eq.  (11)  has  an  additional 
term.  Thus,  _ 

=  e\/2A/,W(r)  +  .A!(A,)|.  (28) 

The  signal-to-noise  ratio  is 


S/N^ 


_ #?(A/) _ 


(29) 


To  find  noise  equivalent  power,  we  need  to  set  the  signal-to-noise  ratio  equal 
to  one.  Then  Eq.  (29)  can  be  solved  for  the  signal  flux.  Multiplying  the 
signal  flux  by  the  energy  of  a  single  photon  gives  the  desired  result: 


NEP{Xi)  = 


hcAf 

Xri 


1  + 


2ri 


'1  +  Xf'^liT) 


(30) 


Once  again,  we  must  compare  this  equation  with  the  normal  situation  given 
by  Eq.  (18).  After  some  algebra. 


NEP(X,)  = 


\/m) 

Va/ 

[l  +  yi  +  |1«?(T)J 

NEP{X,). 


(31) 


Although  complicated,  this  result  seems  reasonable.  If  the  lab  background 
flux  is  low  enough  so  that  (pHT)  «  Af/'Irj,  then  Eq.  (31)  easily  reduces 
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to  the  same  equation  obtained  in  the  last  section  for  the  all  laser-photons 
case.  Such  a  reduction  is  expected  because  all  laser-photons  implies  a  low 
lab  background  flux.  On  the  other  hand,  if  lab  background  flux  dominates 
so  that  <pI{T)  »  A//277,  then  Eq.  (31)  will  reduce  to  Eq.  (19).  This  result 
was  obtained  in  Section  3.2  when  all  the  background  photons  originated  from 
the  stray  blackbody  radiation  in  the  lab. 

This  third  case  is  probably  the  most  realistic  of  the  three  considered.  The 
first  two  situations  are  merely  the  limits  of  Eq.  (31)  at  high  <pl{T)  and  low 
4>1{T)  compared  to  I'lrj.  In  actual  practice  it  would  seem  likely  that  the 
true  value  will  not  be  at  either  extreme.  Although  the  lab  background  flux 
will  probably  be  relatively  const.int  in  the  DWSG,  the  correction  term  will 
be  different  from  one  detector  system  to  another  if  either  the  bandpass  or 
quantum  efficiency  vary. 


4  Conclusions  and  Recommendations 

In  this  paper  we  have  carefully  considered  whether  the  performance  of  an 
IR  detector  could  change  when  it  is  illuminated  by  monochromatic  radiation 
instead  of  the  usual  blackbody  distribution.  Such  an  analysis  is  important 
for  understanding  the  effectiveness  of  the  simulation  testing  which  will  be 
performed  with  the  DWSG.  It  seems  fair  to  conclude  that  the  simulation 
may  be  affected  because  of  changes  in  the  fundamental  detectivity  limit. 

Three  cases  have  been  identified.  The  first  case  occurs  when  all  the  back¬ 
ground  of  the  simulated  scene  is  generated  by  stray  radiation  from  the  lab. 
Under  this  circumstance  the  BLIP  limit  will  probably  be  reduced  enough 
that  the  detector’s  performance  is  degraded.  In  the  second  case,  the  back¬ 
ground  flux  from  the  lab  is  negligible.  Then  the  BLIP  limit  is  raised  so  the 
performance  of  the  detector  should  be  unaffected.  The  first  two  cases  are 
really  the  extremes  of  the  more  general  third  case.  Consequently,  depend¬ 
ing  on  the  relative  size  of  the  lab  background  flux,  the  detector’s  measured 
performance  may  or  may  not  be  degraded. 

At  this  point,  there  are  several  paths  left  to  follow: 

1.  The  third  case  should  be  carefully  examined  with  numerical  values 
appropriate  for  relevant  Ilv  detectors.  We  need  to  determine  which 
parameter  is  the  critical  one  and  if  the  BLIP  limit  is  likely  to  be  raised 
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or  lowered.  Probably,  a  complete  understanding  of  the  bandpass  A/ 
will  be  crucial. 

2.  If  a  detector  is  very  close  to  the  ideal  limit,  we  need  to  confirm  that  its 
performance  will  not  improve  when  that  limit  is  raised. 

3.  In  this  paper  it  was  assumed  that  the  quantum  efficiency  is  a  constant 
when  in  practice  it  may  vary  by  a  factor  of  two.  Will  this  wavelength 
dependence  have  a  significant  effect  on  the  calculations  reported  here? 

4.  A  measurement  of  the  detectivity  of  a  detector  should  be  attempted  in 
the  DWSG  and  compared  with  the  value  found  under  standard  condi¬ 
tions.  Such  an  experiment  should  prove  to  be  an  important  check  on 
the  validity  of  the  present  analysis. 

5.  Some  consideration  of  the  noise  characteristics  of  a  laser  should  be 
made.  The  assumption  inherent  in  this  paper  is  that  the  laser  photon 
flux  obeys  a  Poisson  distribution.  How  valid  is  this  assumption? 

Hopefully,  this  paper  has  identified  and  correctly  analyzed  one  of  the  ma¬ 
jor  issues  in  the  development  of  the  DWSG.  It  will  take,  however,  a  significant 
amount  of  effort  to  apply  the  results  to  the  actual  data  obtained  with  the 
DWSG. 
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Figure  1:  A  reproduction  of  a  portion  of  the  SBRC  Infrared  Wall  Chart 
which  plots  spectral  detectiviy  as  a  function  of  wavelength. 
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Figure  2;  This  variation  of  the  SBRC  Infrared  Wall  Chart  shows  the  BLIP 
limit  scaled  downward  by  a  factor  of  yjXc  =  0.7.  The  lower  dashed  line  is 
the  BLIP  limit  for  a  photovotaic  detector  being  tested  in  the  DWSG  system 
under  the  conditions  described  in  Section  3.2.  The  upper  dashed  line  is  the 
same  as  in  Figure  1. 
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Abstract 

Measurements  of  absorption  line  broadening  and  shift  have  been  performed 
for  sodium  vapor  with  nitrogen  as  a  perturber  gas.  In  these  studies,  the  sodium 
cell  was  maintained  at  temperatures  of  200-350°C,  while  N2  buffer  gas 
pressures  ranged  from  10-600  Torr.  Collapse  of  the  hyperfine  doublet  of  the  Na 
ground  state  into  a  single  peak  was  investigated  as  a  function  of  probe  laser 
intensity  and  buffer  gas  pressure.  These  studies  are  preliminary  experiments  in 
an  effort  to  develop  a  Resonant  Doppler  Velocimetry  technique  for  hypersonic 
flows  at  Arnold  Engineering  Development  Center  (AEDC). 
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Introduction: 

Resonant  Doppler  Velocimetry  (RDV)  is  a  non-intrusive  diagnostic  technique 
which  uses  laser-induced  fluorescence  from  atoms  or  molecules  in  a  flowing 
gas  to  measure  the  velocity,  temperature,  and  pressure  of  the  gas.  RDV  was 
proposed  by  R.B.  Miles  in  1975  (1),  and  first  demonstrated  in  1977  (2).  The  gas 
flow  field  is  probed  with  a  tunable  dye  laser  directed  so  that  it  has  a  component 
in  the  direction  of  the  flow.  Since  the  atoms  are  moving,  they  exhibit  a  Doppler 
shifted  peak  absorption  frequency.  The  fluorescence  resulting  from  this  shifted 
absorption  may  be  compared  to  that  of  a  stationary  system,  allowing  the 
determination  of  the  component  of  velocity  in  the  direction  of  the  beam. 

Sodium  atoms  are  often  used  as  the  fluorescing  species  in  RDV 
experiments  (3).  Sodium  atoms  have  two  strong  absorption  line  manifolds  in 
the  visible  region.  The  wavelength  of  these  lines  is  very  near  the  peak  output  of 
an  argon-ion  pumped  Rhodamine  6G  dye  laser.  Also,  sodium  is  found  to  occur 
naturally  in  some  high  temperature  exhaust  systems,  eliminating  the  need  to 
seed  the  flow  via  external  means  in  experiments  of  this  type. 

The  work  described  here  is  part  of  an  effort  to  monitor  and  record  sodium 
fluorescence  features  under  a  variety  of  temperatures,  pressures,  and  perturber 
gas  conditions.  Analysis  of  the  data  will  lead  to  a  characterization  and 
recognition  of  sodium  fluorescence  features  for  prospective  test  environments 
involving  hypersonic  flows. 

Objective: 

Sodium  is  well-known  for  its  strong  visible  fluorescence  features. 
Fluorescence  studies  of  sodium  to  determine  number  densities,  line  broadening 


parameters,  and  line  shift  parameters  are  well  documented  (4,  5,  6).  However, 
in  order  to  effectively  implement  the  RDV  method,  it  is  crucial  that  the  nature  of 
the  fluorescence  profile  in  a  stationary  system  be  understood.  With  this  goal  in 
mind,  studies  have  been  initiated  to  observe  the  position  and  shape  of  sodium 
fluorescence  features  as  a  function  of  perturber  gas  constituent,  temperature, 
and  pressure. 

Apparatus: 

The  excitation  and  detection  system  for  observing  sodium  fluorescence  is 
shown  in  schematic  form  in  Figure  1.  A  Spectra-Physics  Model  171  Argon-ion 
laser  was  used  to  pump  a  Coherent  699-21  ring  dye  laser.  Using  Rhodamine 
6G  dye,  the  ring  laser  was  able  to  deliver  1W  of  single  frequency  radiation 
peaking  at  approximately  590  nm.  The  ring  dye  laser  is  capable  of  scanning  a 
30  GHz  frequency  range  is  0.25  sec.  A  Fabry-Perot  interferometer  monitored 
the  laser  line  to  check  single  mode  frequency  stability.  The  laser  was  directed 
into  a  temperature  controlled  fluorescence  cell  (Comstock  heat  pipe)  which 
could  be  evacuated  to  the  millitorr  range.  Transmitted  intensity  was  monitored 
with  a  silicon  photodiode,  and  fluorescence  was  detected  at  90°  to  the  beam 
direction  with  an  RCA  8644  photomultiplier  tube  with  S-20  response.  The 
detector  output  was  collected  by  a  LeCroy  9400  digital  oscilloscope,  triggered 
synchronously  with  the  scanning  electronics  of  the  ring  laser.  Fluorescence 
plots  were  recorded  on  the  scope  face  and  dumped  to  an  HP7475A  plotter. 
Experiment: 

Sodium  vapor  in  a  heated  cell  was  excited  by  a  continuous  wave  ring  dye 
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laser,  tuned  4-30  GHz  across  the  sodium  resonance  line.  This  resonance 
line  is  at  a  wavelength  589.6  nm,  corre  onding  to  the  32Si/2  to 
transition  in  sodium  (Figure  2).  Absorption  and  fluorescence  profiles  were 
monitored  for  sodium  cell  temperatures  ranging  from  200-350OC.  This 
corresponds  to  sodium  number  densities  in  the  range  approximately  tO''^  - 
10^5  atoms/cm^.  These  profiles  were  recorded  in  order  to  measure  the 
resonance  lineshift  and  line  broadening  present  for  different  pressures  of 
perturber  gas.  In  all  experiments,  the  perturber  gas  introduced  into  the  sodium 
cell  was  nitrogen. 

McCartan  and  Farr  have  shown  the  broadening  and  shift  of  the  sodium  D 
lines  to  depend  linearly  on  perturber  gas  pressure  up  to  about  0.8  atm  (7). 
These  studies  were  conducted  using  Ar  rather  than  N2  as  the  perturber  gas. 
Jongerius  et.  al.  (5)  have  measured  the  collisional  broadening  and  shift  of  the 
Na-D  lines  in  a  vapor  cell  at  approximately  460K  containing  Ar,  N2  and  H2  as 
perturber  gases.  According  to  their  results,  in  a  vapor  cell  with  N2  at  460K  the 
broadening  coefficient  is  2.74  x  10~20  cm'Vatom/cm^.  Ideally,  the  number 
density  is  related  to  temperature  and  pressure  by  the  following: 


19 

n  =  2.7x  10 


'atm  T (  K) 


) 


(1) 
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So  that  at  460K  wa  expect  a  broadening  of 


(2.74  X  10  cm  (2.3  x  10^®  •  P(Torr))  (2) 

or  18.7  MHz  per  Torr  change  in  perturber  pressure. 

Experiments  performed  by  AEDC  personnel  with  N2  perturber  pressures 
ranging  from  10-600  Torr  were  performed  in  a  Na  vapor  cell  at  526K.  These 

studies  were  found  to  yield  a  broadening  parameter  of  19.9  ±  9  MHz  per  Torr 
pressure  change. 

Jongerius  also  reports  an  observed  pressure  shift  coefficient  for  N2  perturber 
gas  as  0.82  x  10“20  cm'Vatom/cm^,  This  shift  is  toward  lower  frequencies,  and 
would  correspond  to  a  frequency  shift  of  5.55  MHz  per  Torr  pressure  change. 
Miles  (1 )  has  put  forth  a  pressure-temperature  relationship  for  frequency  shift, 
which  for  N2  can  be  expressed  as 

Af  =  -6.76  Ghz  •  P(atm)  •  SQRT(300K/r)  (3) 

At  460K,  this  results  is  Af  =  7.2  MHz  per  Torr  change  in  perturber  pressure. 

AEDC  experiments  at  526K  yield  red  shifted  frequencies  for  Na  in  N2  of 
approximately  7.7  MHz  per  Torr  change  in  perturber  pressure. 

The  set  point  temperature  and  pressure  of  the  sodium  vapor  cell  used  in  the 
AEDC  experiments  proved  difficult  to  maintain,  thus  the  data  obtained  were  not 
as  repeatable  as  desired.  It  was  noted  that  in  many  trials,  the  maximum 
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observed  absorption  of  the  excitation  beam  could  reach  40%.  This  allowed  the 
possibility  of  significant  lineshape  distortion  due  to  radiation  trapping.  It  was 
observed  that  for  any  given  pressure,  it  was  possible  to  see  changes  in  the 
fluorescence  lineshape  by  altering  the  intensity  of  the  probe  beam.  This 
phenomenon,  discussed  by  Walkup  et.  al  (8),  was  investigated  for  its  probable 
importance  in  accurately  determining  the  peak  frequency  of  absorption  and 
fluorescence  in  RDV  experiments. 

Sodium  has  a  ground  state  hyperfine  splitting  of  1.77  GHz  (Figure  3)  which 
can  easily  be  resolved  by  the  instruments  used  in  this  investigation.  The 
absorption  'ineshape  for  sodium  vapor  is  expected  to  show  the  superposition  of 
two  Voigt  profiles  separated  by  1.77  GHz  and  weighted  in  intensity  by  the 
degeneracy  factors  of  the  hyperfine  levels,  5  to  3.  Walkup  observed  the 
collapse  of  this  hyperfine  doublet  into  a  single  peak  due  to  laser-induced  optical 
pumping  at  laser  intensities  far  below  radiative  saturation  levels.  Conditions 
were  calculated  which  showed  this  lineshape  distortion  to  occur  when  the 
product  of  laser  power  and  buffer  gas  pressure  exceeded  0.1  mW  Torr. 

Walkup’s  work  was  conducted  with  10  Torr  of  Xenon  as  the  perturber  gas. 

This  altering  of  lineshape  was  also  observed  in  AEDC  experiments.  In  order 
to  predict  pressure  and  intensity  regimes  where  lineshape  distortion  might 
occur,  calculations  were  performed  using  the  method  of  Walkup  (8).  The 
absorption  rate  per  unit  volume  is  expressed  as 
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[D  +  (ai2  +  «2i)'^2]  "2  °^2  Ip  <“12  “21)^1] 


(4) 


A  = 


+  a^2 


+  a2iR2 


where  the  R  factors  represent  the  stimulated  absorption  rates  and  are  Voigt 
profiles. 


Fl,  =  .5yf(I/l3)  jdv 


/Vj 

e _ I _ 

Jn  +  (o)  -  kv  -  (Oj)^ 


(5) 


Here,  the  spontaneous  decay  rate  Ys  =  6.3  x  10^/sec:  the  saturation  intensity 

Ig  =  12mW/cm2;  propagation  constant  k  =  2n/k',  the  Lorentzian  HWHM 

T  =  0.5(Ys  +  bp),  where  b  is  the  pressure  broadening  coefficient,  and 

=  (2kT/m)‘'^2  In  equation  4,  the  a’s  are  branching  ratios  and  are  chosen  to  be 

statistical  {ai2  =  -625  and  a2i  =  .375),  as  are  the  equilibrium  ground  state 

populations  n.|0,  n2°.  Values  for  diffusion  rate  D  and  coefficient  of  pressure 
broadening  b  were  taken  from  sources  by  given  Walkup. 

Figures  4-8  show  the  results  of  these  calculations.  In  Figure  4,  the 
overlapping  Voigt  profiles  are  denoted  by  asterisks  while  the  solid  line  shows 
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the  resulting  absorption  lineshape.  In  figures  5-8,  the  pressure  is  held  constant 
while  the  laser  intensity  is  increased,  and  the  resulting  lineshape  profile  is 
shown.  Note  that  when  the  pressure  intensity  product  reaches  -0.1  mW  Torr, 
there  is  only  one  maximum  evident,  and  this  peak  becomes  stationary  after  the 
intensity  reaches  0.01  mW.  The  fact  that  this  peak  location  shifts  to  a  final 
frequency  alleviates  fear  that  the  absorption  profile  is  in  continuous  motion  and 
hence  can’t  be  used  as  a  reference  in  RDV  experiments.  In  fact,  as  the  peak 
narrows  with  increased  laser  intensity,  peak  locations  become  more  easily 
measurable.  These  calculations  were  repeated  for  pressures  of  10,  50  and  200 
Torr  with  similar  results.  It  should  be  noted  that  increasing  laser  intensity  can 
result  in  observation  of  Lamb  dips  in  absoiption  profiles,  which  show  up  as  dips 
or  depressions  at  the  peak  of  the  absorption  curve. 

Conclusions: 

More  experiments  are  necessary  for  a  complete  understanding  of  sodium 
absorption  and  fluorescence  in  the  presence  of  buffer  gases  at  AEDC.  Efforts 
should  be  made  to  increase  the  pressure  and  temperature  stability  of  the  test 
environment,  and  experiments  should  be  carried  out  at  low  probe  laser 
intensities  to  observe  absorption  lineshapes  and  shifts  without  saturation  effects 
or  optical  pumping. 
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Ficure  1.  Schematic  of  Experimental  Equipment 


t 


« 


4 


4 


4 


Figure  2.  Energy  levels  of  Sodium  atom  (3), 
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Figure  3.  Hyperfine  splitting  of  energy 
levels  in  the  Na-D  lines  (4) . 
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Figure  4.  Fluorescence  intensity  for  the  Na-Dj  lino  (solid  curve).  Asterisks 
represent  the  two  Voigt  profiles  which  make  up  the  solid  curve, 
pressure  is  100  Torr,  laser  intensity  is  lE-6  mW. 
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Figure  5.  Fluorescence  intensity  for  the  Na-D^  line.  N2  pressure  is  100  Torr, 
laser  intensity  is  IE- 4  mW. 
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laser  intensity  is  IE- 3  mW, 
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Figure  7.  Fluorescence  intensity  for  the  Na-D^  line.  N2  pressure  is  100  Torr, 
laser  intensity  is  IE- 2  iriV. 
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laser  intensity  is  lE-1  mW. 
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ABSTRACT 

An  experimental  study  on  crossflow  jet  mixers  was  undertaken 
to  evaluate  the  effect  of  various  similarity  parameters  on  mixing 
results.  The  tests  were  carried  out  in  three  mixers  of  similar 
shape  but  different  sizes.  Two  common  fluids,  air  and  water,  were 
used  as  the  test  mediums.  In  the  air  mixer,  heated  air  jets  were 
injected  and  mixed  with  a  primary  flow  of  air  at  ambient 
temperature;  and  in  the  water  mixer,  jets  of  salt  brine  were  mixed 
with  a  primary  flow  of  fresh  water.  The  scalar  fields  (temperature 
or  concentration)  in  the  mixing  chambers  under  various  operating 
conditions  were  measured  and  the  measured  data  afforded  an 
assessment  on  the  relative  importance  of  the  similarity  parameters 
on  mixing.  Based  on  such  results,  several  parameters  pertinent  to 
mixing  operation  were  identified.  It  appears  that  the  mass 
transfer  process  in  a  water  mixer  does  not  duplicate  the  heat 
transfer  process  in  an  air  mixer.  Thus  the  validity  of  using  mass 
transfer  process  in  water  mixers  to  simulate  the  heat  transfer 
process  in  air  mixers  is  opened  to  guestion. 
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NOMENCLATURE 

a  constant  for  jet  trajectory 

b  constant  for  jet  trajectory 

c  time  average  concentration  at  a  spatial  point 
c.  concentration  of  salt  brine  before  mixing 

JO 

c  concentration  of  salt  in  primary  fluid  before  mixing 
0 

d  diameter  of  the  mixing  chamber 

D  diameter  of  the  mixing  chamber 

intensity  of  segregation 
j  momentum  flux  ratio 

L  length  of  the  far  field 

Lj  length  of  segregation 

Pr  Prandtl  number 

Rej  Reynolds  number  for  the  jet  fluid 
Re^  Reynolds  number  for  the  primary  fluid 
Ri  Richardson  number 

S  pitch  between  injection  ports 

Sc  Schmidt  number 

St  Stanton  number 

t  time 

t^  time  constant  of  mixing 

t^  resident  time 

T  temperature 

ini-  1  temperature  of  the  jet  fluid 

T  initial  temperature  of  the  primary  fluid 

0 

Vj  jet  velocity 


15^ 


V  velocity  of  the  primary  fluid 
X  axial  distance  from  the  plane  of  injection 
Xg  length  of  the  near  field 

z  vertical  distance  from  the  injection  port. 

Greek  Alphabet 

E  dissipation  rate  of  turbulent  energy 
0  dimensionless  temperature 

u  kinematic  viscosity 

pj  density  of  jet  fluid 

po  d  .nsity  of  primary  fluid 

a  standard  deviation  of  concentration 

T  dimensionless  time 

$  dimensionless  concentration 
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I .  INTRODUCTION 

The  idea  of  attaching  a  jet  mixer  to  an  arc  heater  was 
conceived  at  the  Arnold  Engineering  Development  Center  (AEDC)  to 
upgrade  a  facility  for  ground  simulation  of  hypersonic  flow  in  the 
re-entry  flight  of  space  vehicles.  In  the  mixer,  the  jets  are 
issued  in  a  direction  normal  to  the  primary  flow.  The  development 
of  such  a  mixer  was  perceived  as  primarily  an  empirical  task 
because  a  reliable  analytical  method  for  the  mixing  process  has  not 
been  fully  developed.  On  the  other  hand,  the  environment  of 
extremely  high  temperature  (more  than  5000  K)  and  pressure  (more 
than  150  atm)  in  the  prototype  mixer  is  very  difficult  and  costly 
to  duplicate  in  a  pilot  test  facility;  it  is  desirable  to  simulate 
the  actual  process  with  common  fluids  such  as  air  or  water  at 
moderate  temperature  and  pressure.  In  particular,  AEDC  has 
available  a  water  tunnel  test  facility  and  it  is  desirable  to 
perform  test  with  that  facility.  The  cost  of  the  testing  can  be 
cut  down  even  further  if  the  heat  transfer  process  can  be  modeled 
by  a  mass  transfer  process  where  salt  brine  is  injected  and  mixed 
with  the  primary  flow  of  fresh  water  in  a  water  mixer.  The 
validity  of  such  an  approach  needs  to  be  examined  on  the  basis  of 
similarity  law  for  the  process  in  jet  mixers.  A  complete  set  of 
similarity  parameters  for  a  jet  mixer,  to  the  author's  knowledge, 
has  not  been  published  in  the  literature.  Previous  works  were 
often  confined  to  two  types  of  studies,  namely,  (1)  the  study  of 
jet  trajectory  for  either  a  thermal  plume  (Wright,  1977)  or  an 
isothermal  jet  (Moussa  et  al.,  1977);  and  (2),  the  study  of  mixing 
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in  a  pipe  mixer  (Forney  et  al.,  1979). 

In  the  summer  of  1989,  the  author  participated  in  the  USAF-UES 
Summer  Faculty  Research  Program  and  his  task  was  to  identify  the 
similarity  parameters  for  the  mixing  process  to  facilitate  the 
preparation  of  an  experimental  program  for  the  design  of  the  mixer. 
As  the  result  of  that  work,  a  set  of  dimensionless  similarity 
parameters  was  developed  (Lee,  1989)  and  this  analysis  was  further 
updated  in  a  later  work  to  include  the  effect  of  heat  transfer 
process  between  the  jet  stream  and  the  primary  fluid.  This  set 
includes  a  total  of  nine  dimensionless  parameters  of  which  two  are 
related  to  the  geometry  of  the  mixer  and  the  other  seven  are 
derived  from  equations  governing  the  mixing  process.  The  research 
work  also  found  that  it  was  not  feasible  to  satisfy  all  the 
conditions  imposed  by  the  seven  parameters  unless  the  two  fluids 
being  mixed  in  the  pilot  mixer  are  exactly  the  same  as  those  fluids 
in  the  prototype  mixer.  However,  such  a  stringent  requirement  can 
be  relaxed  if  some  of  the  seven  parameters  are  found  to  be  of 
secondary  importance  to  the  mixing  process.  The  relative 
importance  of  the  seven  parameters  can  be  found  only  through 
experiments. 

This  work  is  a  continuation  of  the  research  started  in  the 
summer  of  1989.  The  goals  are  to  rank  the  relative  importance  of 
the  seven  parameters  through  an  experimental  program  and  to  check 
whether  the  process  in  a  water  mixer  duplicates  that  in  a  gas 
mixer.  To  this  end  three  mixers  of  similar  shape  but  different 
sizes  were  designed  and  built  as  the  testing  facility.  Of  the 
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three  mixers,  two  were  water  mixers  where  salt  brine  was  injected 
into  a  primary  stream  of  fresh  water  and  the  resulting 
concentration  field  was  measured;  the  third  was  a  gas  mixer  where 
heated  air  was  injected  to  a  flow  of  ambient  air  and  the  resulting 
temperature  field  was  measured.  By  varying  the  experimental 
conditions  the  effect  of  each  parameter  on  mixing  can  be  isolated 
for  evaluation.  The  results  from  such  experiments,  together  with 
those  from  other  investigators,  afford  an  assessment  of  the  effect 
of  each  parameters.  The  comparison  between  the  concentration  field 
in  the  water  mixers  and  the  temperature  field  of  the  gas  mixer 
illustrates  whether  mixing  in  a  water  mixer  can  simulate  the  mixing 
in  a  gas  mixer. 

This  report  includes  a  brief  review  of  previous  works, 
followed  by  a  description  of  the  test  apparatus  and  experimental 
procedures,  the  presentation  of  experimental  results,  discussion 
and  conclusions. 
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II.  OBJECTIVES 


1.  To  design  and  construct  three  model  mixers  for  evaluating 

similarity  criteria  for  the  mixing  process  in  jet  mixers. 

2 .  To  compare  the  relative  importance  of  the  similarity 
parameters  and  to  identify  the  ones  pertinent  to  the 
mixing  process. 

3.  To  examine  the  validity  of  simulating  the  heat  transfer 
process  in  the  prototype  mixer  with  the  mass  transfer 
process  in  a  water  mixer. 

4.  To  recommend  an  experimental  approach  to  facilitate  the 
design  of  the  prototype  mixer. 

III.  BASIC  THEORY 

According  to  the  analysis  of  Lee  (1989)  ,  the  flow  in  the  mixer 
can  be  divided  into  two  zones,  the  near  field  and  the  far  field. 
The  near  field  is  located  between  the  injector  and  the  point  where 
the  jets  impinge  on  one  another.  In  the  near  field,  the  identity 
of  each  jet  remains  and  the  mixing  is  accomplished  through 
entrainment.  The  rest  of  the  mixer  downstream  of  the  near  field  is 
called  the  far  field  where  all  the  jets  lose  their  identity  and  the 
mixing  is  accomplished  by  turbulent  diffusion.  Corresponding  to 
these  two  mixing  mechanisms,  two  sets  of  similarity  parameters  were 
deduced  by  Lee  (1989)  .  A  summary  of  this  work  is  given  in  what 
follows. 

A.  The  Near  Field;  The  similarity  parameters  were  derived  from 
the  method  of  physical  similitude  (Kline,  1986) .  In  this  method, 
the  dimensionless  parameters  were  deduced  by  taking  the  ratio  of 
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the  individual  terms  of  the  conservation  equations.  Thus  the 
conservation  equations  of  mass,  momentum  and  energy  for  a  jet  in 
cross  flow  were  invoked  and  the  method,  led  to  the  following 
dimensionless  parameters 

T  =  f  f  fid  Y-  St,  Ri,  Re.,  ReJ 

To-  Tj,  \po,V^,  / 

A  more  general  relationship  should  also  include  the  effect  of 

geometrical  factors,  which  consist  of  the  diameter  ratio  between 

the  injector  and  the  mixer,  and  the  pitch  to  diameter  ratio  (a 

parameter  related  to  the  number  of  injection  ports) .  The  inclusion 

of  these  two  parameters  yields 

T  -  T.  =  f/fid  v.  St,  Ri,  Re.,  Re^,  d,  S  ]  (1) 

To-  Tjo  \PO,  V^,  D  D/ 

The  dimensionless  parameters  shown  in  Eq. (1)  are  more  general  than 
those  given  by  Moussa  et  al  (1977)  (whose  study  was  confined  to 
non-buoyant  flow)  and  those  provided  by  Wright  (1977)  and  Forney  et 
al  (1982)  (which  concerned  mainly  with  mixing  in  a  pipe  mixer)  .  In 
fact,  it  can  be  shown  that  the  dimensionless  parameters  deduced 
from  these  investigations  are  only  a  subset  of  those  shown  in 
Eq. (1) . 

As  is  well  known  from  dimensional  analysis,  the  functional 
relationship  in  Eq.(l)  can  only  be  obtained  empirically  and  to 
achieve  such  a  goal  would  require  extensive  testing.  The  goal  of 
this  study,  however,  is  to  identify  the  parameters  pertinent  to  the 
mixing  process  and  to  compare  the  mixing  process  in  air  and  water 
mixers. 
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The  Far  Field. 


B.  The  Far  Field.  In  turbulent  mixing  theory,  the  degree  of 

mixing  is  specified  by  the  intensity  of  segregation  Is  which 

represents  the  difference  between  the  ingredients  being  mixed. 

Corrsin  (1957,  1964)  showed  that  in  isotropic  turbulence  the 

intensity  of  segregation  would  decay  exponentially  with  time,  i.e. 

Is  =  rms  value  of  temperature  fluctuation  at  time  t  =  e'  (2) 

initial  rms  value  of  temperature  fluctuation 

where  t^  is  the  time  constant  of  mixing.  The  value  of  t^.  can  be 

evaluated  from  the  following  equations. 

i,  for  gas  Pr  (or  Sc)  <  1 


tc  =  /5 


(\) 


3-Pr^ 


(3) 


ii,  for  liquid 


Pr  (or  Sc)  ))  1 

In  Prj 


^  ^  (i)‘ 


(4) 


By  normalizing  equation  (2)  ,  Lee  (1989)  obtained  the  following 
relations: 

•tR/tC  T 


Is  =  e- 


(5) 


where  t^  is  the  resident  time  of  the  fluid  in  the  mixer 
tp  =  L/V^  ,  and  r  is  a  dimensionless  time 

Thus  the  dimensionless  parameter  in  the  far  field  is  the  time 
ratio  t^/tg.  As  can  be  seen  from  Eqs.  (3)  and  (4)  ,  the  time 
constant  t^  is  dependent  on  Lj  (the  scale  of  segregation)  which  is 
in  turn  dependent  on  the  result  of  mixing  in  the  near  field.  Thus 
the  degree  of  mixing  in  the  far  field  is  related  to  all  the 
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the  degree  of  mixing  in  the  far  field  is  related  to  all  the 
parameters  specified  in  Eg.  (1),  in  addition  to  the  resident  time 
in  the  mixer. 

Combining  the  dimensionless  parameters  for  the  near  field  and 
the  far  field,  one  obtains 

T  -  =  F/fii  V  St,  Ri,  Re.,  Re^,  d  S  t./t^^  (6) 

Too-  \^po,  V^,  D,  D,  J 

For  mass  transfer  process,  the  temperature  should  be  replaced  by 

concentration  and  St  should  be  the  Stanton  number  for  mass 

transfer. 

IV.  TEST  APPARATUS  AND  EXPERIMENTAL  PROCEDURE 

A.  Test  Apparatus 
1.  The  mixers 

» 

Three  mixers,  two  for  aqueous  flow  and  the  third  for  gaseous 
flow,  were  designed  and  built  at  the  University  of  Wisconsin- 
Platteville.  The  three  mixers  are  similar  in  shape,  but  different 
in  size  (the  diameter  of  the  mixing  chamber  for  the  two  water 
mixers  is  respectively  4  and  6  inches,  and  is  8  ”  for  the  gas 
mixer) .  A  schematic  drawing  of  the  three  mixers  is  shown  in  Fig. 
1.  As  can  be  seen  from  the  drawing,  a  mixer  consists  of  three 
sections,  i.e.  the  head,  the  middle  and  the  end  sections.  The  head 
section  includes  the  inlet  for  the  primary  fluid,  a  diffuser,  a 
flow  straightener,  and  a  settling  chamber.  The  middle  section 
consists  of  a  collar  and  the  mixing  chamber.  Four  injecting 
nozzles  are  mounted  with  even  spacing  around  the  circumference  of 
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the  collar  on  a  plane  normal  to  the  axis  of  the  mixer.  The  four 
nozzles  can  be  either  closed  or  opened  so  that  the  number  of  the 
injection  ports  can  be  varied.  By  changing  the  size  of  the  nozzle 
one  may  also  vary  the  jet  diameters.  The  end  section  is  a 
convergent  flow  passage  connecting  to  the  exit  of  the  mixer.  Holes 
are  drilled  along  the  wall  of  the  mixing  chamber  either  for  taking 
samples  (as  for  water  mixers)  or  for  installing  thermocouple  probes 
or  velocity  probes  (as  for  the  gas  mixer) .  The  holes  are  located 
at  0.375  D,  0.75  D,  1.5  D,  3  D,  4.5  D,  and  6  D  from  the  injector 
plane. 

The  flow  system  for  the  two  water  mixers  is  shown  in  Fig.  2. 
The  primary  fluid  (fresh  water)  was  supplied  from  a  constant  head 
tank  (600  gallons,  63  ft  head  above  ground)  and  a  recirculation 
pump.  The  injected  fluid  (salt  brine)  was  pre-mdxed  in  a  brine 
tank  and  then  delivered  to  the  injection  nozzles  by  a  centrifugal 
pump  via  a  manifold.  The  flow  rates  of  the  primary  fluid  and  the 
injected  fluid  were  measured  respectively  by  a  turbine  flow  meter 
and  four  orifice  meters.  The  photographs  of  the  two  water  mixers 
are  shown  in  Fig.  3a  and  3b. 

The  flow  system  (Fig.  4)  for  the  gas  mixer  consists  of  a 
radial  flow  fan  (5"  H2O  head,  1000  cfm)  which  is  used  to  deliver 
primary  flow  of  air  at  room  temperature  (21*  C)  to  the  mixing 
chamber.  The  injected  air  is  heated  by  an  electric  heater  (1.3  KW) 
and  delivered  to  the  injection  nozzle  by  a  blower  (21"  H2O,  100 
cfm) .  Orifice  meters  are  used  to  measure  the  flow  rates  of  both 
the  primary  and  the  injected  fluids. 


63  Feei;  of  lleacl 


Schematic  of  the  water  flow  system 


Nicrone 

Heater 

Section 


Heated  Pipe 


Jet  In jector 


Air  Tunnel 


Fig.  4a  Air  flow  system  for  the  air  mixer 


Fig.  4b  The  air  mixer  system 
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2 .  Measurement  Techniques 

The  concentration  field  in  the  two  water  mixers  was  obtained 
by  taking  samples  (with  hypodermic  needles)  from  the  sampling 
holes.  The  sampling  location  can  be  varied  radially  by  traversing 
the  tip  of  the  needle,  axially  by  inserting  the  .  eedle  in  various 
sampling  holes  along  the  mixing  chamber,  and  angularly  by  rotating 
the  injection  collar.  The  disturbance  caused  by  the  hypodermic 
needle  was  negligible  because  the  size  of  the  needle  was  very  small 
(1/32"  O.D.)  and  the  rate  of  drawing  was  very  slow  (0.5  cc/sec) . 
The  sample  (approximately  30cc)  was  then  placed  in  a  test  tube  and 
the  concentration  was  measured  by  a  conductivity  meter  (model  CDB- 
70,  Omega  Co)  whose  accuracy  was  0.5%  of  the  concentration  level 
being  measured.  The  calibration  of  the  meter  indicated  linear 
response  over  the  range  of  measurement. 

The  temperature  field  in  the  gas  mixer  was  measured  with 
copper-constantan  thermocouples.  Nine  thermocouples  (AWG  No.  32) 
were  fixed  to  a  rake  over  a  distance  of  4  inches.  The  rake  was 
then  inserted  to  the  mixing  chamber  through  the  holes  on  the  walls. 
The  signal  from  the  thermocouples  was  then  recorded  by  a  computer- 
data-acquisition  system  at  a  sample  rate  of  2000  Hz.  Fifty 
readings  were  taken  in  each  sample  and  the  average  temperatures 
were  calculated  through  a  software  package. 

The  velocity  field  around  the  jet  was  measured  with  a  five- 
hole  direction  probe. 
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B.  Limits  of  the  Measurement  Technique 

The  measurement  technique  mentioned  abcva  can  yield  only  the 
time  average  of  the  scalar  field.  From  the  measurement  the  average 
value  over  a  cross  section  can  be  taken  and  the  deviation  from  the 
average  can  also  be  deduced.  The  uniformity  of  the  scalar  field  is 
an  indicator  of  the  mixedness  in  the  mixer.  However,  the  deviation 
of  the  scalar  field  from  the  average  is  different  from  the  concept 
of  intensity  of  segregation  mentioned  in  £q.  (2)  which  is  the  rms 
value  of  the  fluctuation  measured  at  a  spatial  point  with  respect 
to  time.  Thus  the  method  employed  in  this  experiment  cannot  be 
used  to  verify  the  relations  specified  by  Eg.  (5)  . 

C.  Experimental  Schedule 

With  the  three  mixers  of  different  sizes  operating  with  two 
different  test  mediums,  one  may  examine  the  'effect  of  the 
dimensxonless  parameters  in  Eq.  (6)  on  mixing  by  varying  the 
density  ratio,  the  velocity  ratio,  the  flow  rate,  the  diameter  of 
the  nozzle  and  the  number  of  injection  ports.  Comparison  between 
the  scalar  fields  in  the  air  mixer  and  the  water  mixers  can 
illustrate  whether  test  on  water  mixer  can  simulate  the  test  on  air 
mixer. 

The  experimental  matrix  for  the  air  mixer  and  the  two  water 
mixers  is  listed  in  Tables  1,2,  and  3. 
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Table  1.  Experimental  matrix  for  the  gas  mixer  (Primary  fluid: 


Number 
of  jets 

ambient 

Velocity 

ratio 

air  at  21*C) 

Reynold 

number 

Diameter 

ratio 

Temperature  of 
jet  fluid  ft) 

1 

3.75 

29500 

1/16 

97 

1 

3.75 

88000 

1/16 

96 

1 

5 

88000 

1/16 

97 

1 

5 

29500 

1/16 

90 

1 

1.8 

88000 

1/8 

111 

1 

5 

88000 

1/8 

115 

2 

2 

88000 

1/16 

98 

2 

3.75 

88000 

1/16 

99 

2 

5 

29500 

1/16 

98 

2 

5 

50000 

1/16 

100 

2 

5 

88000 

1/16 

100 

2 

6 

88000 

1/16 

111 

2 

7 

88000 

1/16 

115 

4 

3.75 

88000 

1/16 

86 

4 

5 

88000 

1/16 

84 

4 

5 

50000 

1/16 

88 

4 

5 

29500 

1/16 

86 

4 

7 

88000 

1/16 

• 

82 

Table  2. 

Experimental 

matrix  for  water 

mixer  #1 

(Dia.  = 

4;  primary  fluid:  fresh  water 

from  water 

main) 

Number 

Concentration  of 

of  jets 

v./v 

Re 

d/D 

jet  fluid  Cj(%) 

1  4 
1  4 
1  4 
1  4 
1  5 
4  3 
4  4 
4  5 


12000 

1/16 

7.5 

16000 

1/16 

7.5 

16000 

1/16 

15 

16000 

1/16 

1.8 

16000 

1/16 

7.5 

16000 

1/16 

7.5 

16000 

1/16 

7.5 

16000 

1/16 

7.5 

4 


•i 


I 


4 
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Table  3. 

Experimental 

matrix  for  water 

mixer  #2 

(Dia.  = 

6";  primary  fluid:  fresh  water 

from  water 

main) 

Number 

Concentration  of 

of  jets 

v/v 

Re 

d/D 

jet  fluid  Cj(%) 

1 

3.75 

30000 

1/16 

7.5 

1 

4 

16000 

1/16 

7.5 

1 

5 

30000 

1/16 

7.5 

2 

4 

30000 

1/16 

7.5 

2 

5 

30000 

1/16 

7.5 

D.  Indicator  of  Mixedness. 


In  this  work,  the  degree  of  mixing  (or  mixedness)  is  defined 
as  the  standard  deviation  a  of  the  local  time  average  concentration 
(i.e.,  a  spatial  point  in  a  cross  section)  with  respect  to  the 
average  concentration  over  the  entire  cross  section.  The  scalar 
field  (either  the  temperature  field  or  the  concentration  field)  was 
first  measured  and  the  results  were  converted  to  the  following 
dimensionless  forms: 


a.  temperature  parameter  9  =  T  -  (7) 

-  T„ 

iO  0 

b.  concentration  parameter  i  -  C  -C^  (8) 

C.  -  C 

JO  o 

According  to  Shope  (1989)  the  standard  deviation  a  at  the  exit  of 
the  mixer  should  be  less  than  5%.  The  distance  required  by  the 
fluid  stream  to  attain  such  a  degree  of  mixing  will  be  called  the 
mixing  distance.  Incidentally,  this  is  also  the  minimum  length  of 
the  mixer.  Such  a  length  is  not  measured  in  this  work  because  the 
primary  goal  of  the  project  is  to  examine  whether  it  is  feasible  to 
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use  a  water  mixer  to  similate  the  mixing  in  an  air  mixer. 

The  standard  deviation  a  ,  used  in  this  work  to  denote 
mixedness,  is  different  from  the  intensity  of  segregation  (shown 
in  Eg.  (2))  which  is  used  in  chemical  engineering  practice. 
According  to  Brodkey  (1967),  the  intensity  of  segregation  is 
ratio  of  the  rms  fluctuating  value  to  the  initial  rms  fluctuating 
value.  To  measure  I^,  one  must  record  the  instantaneous  scalar 
quantity  over  a  certain  period  of  time.  The  sampling  method  used 
in  the  water  mixer  could  not  yield  such  measurements.  Thus  the 
method  employed  in  this  work  cannot  be  used  as  a  direct 
verification  of  Eg.  (5)  .  However,  since  both  and  a  decay  along 
the  direction  of  flow  in  the  mixer,  a  measurement  of  a  can  probably 
be  regarded  as  an  indicator  of  I3,  and  thus  indirectly  reflects  the 
effect  of  the  time  ratio  t^/t^  specified  in  Eg.  (5)..  It  must  also 
be  pointed  out  that  the  determination  of  is  rather  difficult  due 
to  the  lack  of  reliable  methods  to  ascertain  the  values  of  and 
€  in  Eqs.  (3)  and  (4) .  (McKelvey  et  al.  1975) . 

V.  RESULTS 

The  effects  of  the  dimensionless  parameters  specified  in  Eg. 
(6)  on  mixing  are  examined  by  examining  their  effects  on  the 
measured  scalar  fields.  Prior  to  the  presentation  of  such  results, 
one  must  first  have  the  knowledge  concerning  the  extent  of  the  near 
field  and  the  far  field,  and  this  information  can  be  obtained  if 
the  velocity  trajectory  of  the  jet  is  known.  The  velocity 
trajectory  measurement  will  thus  be  given  before  the  presentation 


» 


of  the  scalar  field  measurements. 

A.  Velocity  Trajectory  of  the  Jets. 

There  are  several  definitions  with  regards  to  jet 
trajectories.  Here  the  jet  trajectory  is  defined  as  the  locus  of 
the  maximum  velocity  in  the  plane  of  symmetry,  in  a  manner  similar 
to  that  used  by  Kamotani  et  al.  (1972).  The  intersection  point 
between  the  trajectories  of  two  opposed  jets  is  considered  as  the 
center  of  coalescence  and  the  distance  between  the  center  of  the 
injection  port  and  the  point  of  coalescence  is  defined  as  the 
extent  of  the  near  field.  Such  a  definition  of  the  near  field  is 
only  approximate  because  the  impingement  of  the  jets  would  cover  a 
certain  area  instead  of  at  a  point  (due  to  the  finite  dimension  in 
jet  width  and  thickness) . 

The  measurement  was  carried  out  in  the  air*  mixer  and  the 
velocity  vector  field  was  obtained  by  using  a  five-hole  directional 
probe.  The  result  is  shown  in  Fig.  5  along  side  the  trajectories 
predicted  by  the  following  empirical  equation  developed  by  Kamotani 
et  al.  (1972) : 

z/d  =  a  (x/d)^  (9) 

In  Eg.  (9) ,  z  and  x  represent  respectively  the  penetration  and  the 
longitudinal  movement  of  the  jet;  a  and  b  are  empirical  constants. 
The  comparison  shown  in  Fig.  5  demonstrates  that  the  measured 
trajectories  are  in  reasonable  agreement  with  Eg.  (9)  and  further 
measurement  does  not  seem  to  be  necessary.  Although  the  equation 
is  for  predicting  the  trajectory  of  a  single  jet  in  cross  flow,  it 
can  be  used  for  predicting  the  trajectory  of  multi- jet  as  well 
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(Kamotani,  1974)  .  Thus  Eq.  (9)  will  be  used  in  this  work  to 
predict  the  jet  trajectory  and  the  point  of  coalescence,  and  the 
result  is  shown  in  Table  4 . 

Table  4  Location  of  the  point  of  coalescence  at  various 
velocity  ratios 

- Vj/V, - ^ - 

4  10.17 

5  6.2 

6  4.13 

7  2.63 

In  Table  4,  is  the  axial  distance  between  the  center  of  the 

injection  port  and  the  point  of  coalescence.  As  expected,  the 
value  of  Xg  decreases  as  the  velocity  ratio  increases. 

B.  Effect  of  Number  of  Injection  Ports 

Effect  of  number  of  injection  ports  on  mixing  was  examined  in 
both  the  gas  mixer  and  the  aqueous  mixers.  The  presentation  of  the 
result  will  mainly  focus  on  the  experiments  carried  out  in  the  gas 
mixer  because  they  are  more  complete.  The  mean  temperature 
profiles  were  plotted  in  contour  maps  for  the  case  of  a  single  jet, 
two  diametrically  opposed  jets  and  four  diametrically  opposed  jets. 
The  velocity  ratio  in  this  plot  is  equal  to  5. 

The  contour  map  of  a  single  jet  is  shown  in  Fig.  6.  It  shows 
that  the  contour  exhibits  a  characteristic  kidney  shape.  The  high 
temperature  zone  in  the  map  corresponds  to  the  location  of  a  vortex 
which  was  also  observed  by  other  investigators.  As  the  jet 
proceeds  downstream,  the  spread  and  rise  of  the  jet  can  clearly  be 
seen  by  comparing  the  three  maps  in  Fig.  6. 
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For  the  case  of  double  jets,  the  contour  maps  of  the 
temperature  field  are  shown  in  Fig.  7.  At  the  location  of  x/d  =  6 
(near  the  point  of  coalescence  as  shown  in  Table  4) ,  the  presence 
of  the  vortex  is  still  discernible.  After  the  two  opposed  jets 
impinge  on  each  other,  the  two  pairs  of  high  temperature  zone  merge 
into  one  pair  (only  one  vortex  is  shown)  and  spread  sideways  in  a 
direction  normal  to  the  plane  containing  the  jet  center  line. 

Fig.  8  shows  the  contour  maps  for  four  jets  (forming  two  pairs 
of  opposed  jets)  at  various  axial  locations.  The  two  high 
temperature  zones  in  each  jet  disappear,  probably  due  to  mutual 
interference  from  neighboring  jets.  The  sideways  spread  observed 
in  Fig.  6  is  also  hindered  by  the  other  pair  of  jets  and  the  spread 
after  the  impingement  is  observed  only  in  the  direction  of  45°  with 
respect  to  the  plane  containing  the  center  line  of  the  jet. 

The  standard  deviation  of  the  temperature  field  at  two  cross 


sectional  positions  for 

these  three 

cases  is  shown  in  Table 

Table  5  Standard 

deviation  a 

at  two  cross  sections 

a 

fat  x/d=50^ 

a  fat  x/d=100) 

One  jet 

55% 

Two  jets 

36% 

25% 

Four  jets 

24% 

15% 

Table  5  indicates  that  mixedness  increases  with  the  number  of 
injection  ports.  However,  even  with  four  jets  operating,  the 
standard  deviation  is  still  higher  than  the  targeted  5%  at  a 
location  100  jet-diameter  away  from  the  injection  port. 

Tests  being  carried  out  in  water  mixers  show  the  same  tendency 
as  observed  in  the  gas  mixer.  However,  the  spread  and  standard 


deviation  are  markedly  different.  Such  difference  raises  the 
question  about  the  validity  of  simulating  heat  transfer  process 
with  mass  transfer  process  in  water  mixers.  Such  a  point  will  be 
discussed  in  greater  detail  in  Section  VI. 

C.  The  Diameter  Ratio  d/D 

The  effect  of  diameter  ratio  was  examined  in  the  gas  mixer 
only  and  the  purpose  was  to  compare  the  measured  data  with  previous 
investigation.  The  experiment  was  thus  confined  to  a  single  jet 
being  issued  perpendicularly  to  a  cross  flow  and  the  jet  diameter 
was  varied  only  once. 

The  measured  temperature  profile  at  three  locations  on  the 
symmetrical  plane  of  the  jet  is  shown  in  Fig.  9  for  two  different 
diameter  ratios.  As  can  be  seen  from  Fig.  9,  the  jet  with  larger 
diameter  would  penetrate  deeper  into  the  primary  flow  and  the 
temperature  distribution  is  more  uneven.  This  observation  is  in 
agreement  with  that  given  by  Kamotani  et  al.  (1972)  . 

The  jet  diameter  would  thus  affect  mixing  in  two  opposite 
ways.  An  increase  in  jet  diameter  would  increase  the  size  of  one 
constituent  and  thus  the  unevenness.  On  the  other  hand,  it  will 
also  promote  penetration  and  turbulence  in  the  flow  field  which  is 
beneficial  to  mixing.  To  achieve  the  best  result  of  mixing,  a 
compromise  between  these  two  effects  will  have  to  be  made. 
According  to  the  recommendation  of  Simpson  (1975) ,  the  desired 
diameter  ratio  for  a  multi- jet  cross  flow  mixer  should  be  roughly 
equal  to  1/16.  For  a  single-jet  pipe  mixing,  Forney  (1986) 
suggested  that  at  a  given  jet  velocity  the  diameter  of  the  jet 


should  ensure  a  penetration  to  the  center  line  of  the  pipe.  Since 
this  work  is  dealing  with  multi-jet  mixer,  the  recommendation  from 
Simpson  (1975)  is  subsequently  adopted. 

D.  The  Velocity  and  Density  Ratio 

The  effect  of  these  two  ratios  was  examined  rather  thoroughly 
in  the  works  of  Holdeman  et  al.  (1977)  and  Kamotani  et  al.  (1972, 
1974) .  However,  these  works  were  confined  to  multi- jet  mixing  in 
a  rectangular  duct.  To  complement  the  research  work  mentioned 
above,  experiments  were  carried  out  in  this  work  to  obtain  results 
of  mixing  in  circular  ducts.  This  type  of  results,  to  the  author's 
knowledge,  has  not  been  published  previously. 

The  construction  of  the  mixers  used  in  this  study  does  not 
allow  for  significant  variation  of  density,  as  can  be  seen  from 
Tables  1  to  3.  In  fact,  since  the  change  in  density  is  so  small, 
all  the  tests  carried  out  in  either  the  gas  mixer  or  water  mixer 
can  be  regarded  approximately  as  constant.  Thus  the  effect  of 
density  ratio  will  be  assessed  by  experimental  evidence  provided  by 
other  investigators. 

The  works  of  Kamotani  et  al.  (1972,  1974)  and  Holdeman  et  al. 
(1977)  indicated  that  the  density  and  velocity  were  acted  together 
in  a  combined  fashion  through  a  momentum  flux  ratio  J  (=p jVjVpoV^^)  . 
Apart  from  contributing  to  the  momentum  flux  ratio,  they  conclude 
that  the  effect  of  density  ratio  alone  was  secondary  when  the 
density  ratio  was  within  their  experimental  limit  of  2.8.  The  work 
of  Vranos  et  al.  (1988)  also  tended  to  support  the  above  contention 
in  which  the  jet  trajectory  was  found  to  follow  the  following 


equation 

Z  =  0.76 

d  [P^  J  (10) 

According  to  Eq.  (10) ,  a  twenty-fold  increase  in  density  ratio  will 
bring  about  less  than  40%  change  in  jet  penetration.  However,  the 
effect  of  density  ratio  is  still  uncertain  if  the  ratio  exceeds  the 
experimental  limit  mentioned  above  and  further  work  in  this  regard 
is  recommended. 

The  experimental  result?  quoted  in  the  previous  paragraph  are 
certainly  admissible  to  the  result  of  Eq.  (6) .  The  experimental 
results  just  show  that  the  density  ratio  and  the  velocity  ratio  in 
Eq.  (6)  should  act  in  a  combined  fashion  to  form  a  momentum  flux 
ratio  when  the  density  ratio  is  not  larger  than  2.8. 

The  equipment  used  in  this  work  did  not  allow  for  significant 
variation  in  density,  and  within  the  experimental  limit  no 
dependence  on  density  ratio  was  detected.  The  perception  that  the 
two  ratios  should  be  combined  was  thus  adopted  in  the  present 
study.  The  contour  maps  obtained  from  two  different  momentum  flux 
ratios  are  shown  in  Fig.  10. 

As  can  be  seen  in  Fig.  10,  the  jets  with  higher  J  impinge  with 
each  other  at  an  earlier  stage  in  comparison  with  the  jets  with 
lower  J.  The  standard  deviations  of  average  concentration  a  at  x/d 
=  50  are  21%  (for  J  =  49)  and  45  (for  J  =  14) .  Thus  an  increase  in 
velocity  ratio  is  beneficial  to  the  degree  of  mixing.  Whether  the 
mixedness  will  increase  indefinitely  with  J  remains  to  be  verified. 

The  positive  effect  of  r  on  mixing  is  probably  true  for  multi- 
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jet  mixers  only.  In  this  case  the  jets  will  collide  with  one 
another  at  the  central  axis  of  the  mixer,  and  the  turbulent 
intensity  downstream  of  the  point  of  coalescence  increases  with  the 
momentum  of  the  jets.  An  increase  in  turbulence  level  will 
certainly  be  beneficial  to  the  mixing  process.  On  the  other  hand, 
if  the  mixer  has  only  one  jet  being  issued  into  the  primary  flow, 
then  an  increase  in  momentum  flux  ratio  may  not  be  beneficial  to 
the  mixing  result.  (Forney,  1986) . 

E.  Effect  of  Reynolds  Number  and  Richardson  Number. 

As  can  be  seen  from  Tables  1  to  3,  the  Reynolds  number  ranges 
between  12,000  to  30,000  in  the  two  water  mixers,  and  the  variation 
covers  only  the  test  with  single  jet  injection.  On  the  other  hand, 
experiment  on  the  gas  mixer  is  more  complete  and  the  Reynolds 
number  ranges  between  29,500  and  88,000  for  the  one  jet,  two  jets 
and  four  jets  injection.  The  results  being  presented  in  this 
report  will  thus  be  focused  on  the  air  mixer  and  one  typical  result 
is  shown  in  Fig.  11. 

For  the  gas  mixer,  it  was  found  that  when  the  Reynolds  numbers 
were  within  the  experimental  limits,  no  consistent  correlation 
between  the  standard  deviation  of  the  scalar  field  and  Reynolds 
number  was  detected  for  the  three  injection  arrangements.  This 
observation  seei"S  to  be  in  agreement  with  those  of  Rathgeber  et  al. 
(1983)  and  Kamotani  et  al.  (1972).  The  observation  made  by  those 
investigators,  however,  was  at  a  distance  fairly  close  to  the 
injector  port  where  the  identity  of  the  jet  was  not  lost..  In  the 
present  study,  however,  most  of  the  observations  were  made  in  the 
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far  field  after  the  merging  of  the  jets. 

An  increase  in  Reynolds  number  in  the  gas  mixer  would  bring 
about  two  opposite  effects.  The  first  is  to  increase  the 
turbulence  level  in  the  flow  (which  will  enhance  the  mixing 
process)  ;  the  second  is  to  reduce  the  resident  time  of  the  fluid  in 
the  mixer,  which  tends  to  reduce  mixedness.  The  insensitivity  of 
the  scalar  field  on  Reynolds  numbers  indicates  that  the  two 
opposite  effects  mentioned  above  are  canceling  each  other.  This 
contention  is  in  agreement  with  the  theoretical  analysis  obtained 
by  Beek  et  al.  (1959)  .  In  Beek's  analysis,  the  mixing  was  measured 
by  the  intensity  of  segregation  l^,  a  parameter  referring  to 
fluctuation  with  time  at  a  spatial  point.  While  in  the  present 
work,  the  mixing  is  indicated  by  the  deviation  of  the  average  value 
with  respect  to  the  cross-sectional  average. 

For  the  water  mixers,  the  experimental  data  can  afford  only  a 
comparison  for  a  single  jet  with  Reynolds  number  ranging  from 
12,000  to  30,000.  Such  a  result  is  shown  in  Fig.  12  which  shows  a 
positive  effect  of  increasing  Reynolds  number  on  mixing.  It  must 
be  pointed  out  that  the  number  of  experiments  being  carried  out  is 
not  large  enough  and  thus  the  result  cannot  be  regarded  as  a  solid 
proof  that  the  degree  of  mixing  in  a  liquid  mixer  is  sensitive  to 
Reynolds  numbers,  as  predicted  by  Beek  et  al.  (1959)  .  According  to 
that  analysis  the  increase  in  Reynolds  number  would  reduce  the  time 
constant  t^  at  a  faster  rate  than  the  decrease  in  resident  time  t^ 
thus  the  degree  of  mixing  would  improve  with  increase  in  Reynolds 
number. 
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The  effect  of  Richardson  number  (an  indicator  of  buoyancy 
effect) ,  was  examined  with  the  data  from  the  two  water  mixers  by 
changing  the  concentration  of  the  salt  brine  and  the  size  of  the 
mixer.  Since  the  attainable  density  difference  between  the  primary 
and  the  injected  fluid  is  relatively  small  with  the  present 
experimental  technique,  the  magnitude  of  the  Richardson  number 
covered  in  this  work  is  also  relatively  small.  With  the  density 
ratio  varied  from  1.02  to  1.15  at  a  Reynolds  number  of  16,000,  no 
consistent  correlation  with  Richardson  number  was  found  in  the 
present  work  (Fig.  13)  .  According  to  the  work  of  Forney  et  al. 
(1979) ,  the  buoyancy  effect  would  not  be  significant  if  the 
densimetric  Froude  number  is  larger  than  the  jet-to-primary  fluid 
velocity  ratio,  i.e. 

V: . -  >  Vj 

/gd  pq-p.l~ 

J  po 

or  >  /gd  po~oi  (11) 

y  po 

Since  the  condition  specified  in  Eg.  (11)  is  always  satisfied 
in  this  work,  thus  the  buoyancy  effect  is  probably  not  significant 
enough  to  be  detected  by  the  present  experimental  techniques. 

F.  The  Effect  of  Stanton  Number 

The  Stanton  number  St  is  related  to  the  ratio  of  convection 
heat  transfer  rate  across  the  jet  surface  to  the  heat  carried  by 
the  jet  stream.  In  a  jet  mixer,  the  Stanton  number  is  derived  from 
the  energy  equation  in  the  near  field.  As  is  well  known  in  heat 
transfer  texts,  the  Stanton  number  (h/(pCpV)  is  a  function  of 


15-39 


Reynolds  number  and  Prandtl  number.  In  gas  mixers ,  the  scalar 
field  is  insensitive  to  Reynolds  number,  thus  the  effect  of  Stanton 
number  is  represented  by  the  Prandtl  number  alone.  In  the  case  of 
water  mixer,  the  Reynolds  number  seems  to  have  some  bearings  on  the 
far  field,  thus  to  match  the  Stanton  number  requires  the  matching 
of  both  tne  Reynolds  number  and  Prandtl  number.  Since  the  mixing 
result  in  a  gas  mixer  is  insensitive  to  Reynolds  number,  one  can 
thus  find  the  effect  of  Prandtl  number  by  comparing  the  near-field 
scalar  quantity  in  the  gas  mixer  and  the  water  mixer,  even  though 
the  Reynolds  number  in  the  former  is  not  equal  to  that  in  the 
latter. 

The  mixing  in  the  far  field  depends  on  two  factors,  the  first 
being  the  initial  degree  of  segregation  (represented  by  the  scalar 
quantity  at  the  end  of  the  near  field)  and  the  s.econd  being  the 
time  ratio  t^/t^..  The  time  constant  of  mixing  for  a  gas  mixer  is 
a  function  of  Pr,  and  Re  (see  Eq.  3)  and  a  change  in  Re  would 
also  change  t^.  The  time  ratio,  however,  is  insensitive  to 
Reynolds  number  since  t^  and  t^.  change  with  Reynolds  number  at 
approximately  the  same  manner.  For  the  mass  transfer  process  in 
the  water  mixers,  the  time  constant  t^.  is  a  function  of  Sc,  L^,  Re 
and  V  in  a  manner  depicted  in  Eq.  (4) .  With  Eq.  (4) ,  the  time  t^ 
and  t^  change  differently  with  Re  and  thus  the  time  ratio  is  found 
to  be  dependent  on  Reynolds  number.  Moreover,  the  time  constant 
for  the  mass  transfer  process  in  the  water  mixer  is  much  larger 
than  that  of  the  gas  mixer  because  the  Sc  for  salt  brine  is  745 
(Kays,  1966)  and  the  Prandtl  number  for  air  is  0.7  only.  Thus  by 
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comparing  in  scalar  measurements  in  the  far  field  of  the  gas  and 
water  mixers,  one  can  assess  the  effect  of  Pr  and  Sc  on  mixing. 

The  measured  scalar  field  ^r  the  gas  mixer  and  the  water 
mixers  are  plotted  in  Fig  14  (for  one  jet),  Fig.  15  (for  two  jets), 
and  Fig.  16  (for  four  jets)  .  The  scalar  quantity  in  the  water 
mixers  is  much  non-uniform  in  both  the  near  field  and  the  far 
field.  In  the  near  field,  the  difference  is  caused  by  the 
difference  in  Prandtl  number  (with  Pr  of  water  equal  to  7  and  Pr 
for  air  equal  to  0.7).  In  the  far  field,  the  difference  is  caused 
by  the  difference  in  t^.  For  mass  transfer  in  water,  the  time 
constant  is  much  larger  than  the  time  constant  for  the  heat 
transfer  process  in  the  gas  mixer  (the  Sc  is  more  than  1000  times 
larger  than  Pr) .  This  implies  the  mixing  in  water  is  much  slower 
than  that  in  air.  This  conclusion  is  in  agreement  with  that  give 
by  Simpson  (1972) . 

VI.  VALIDITY  OF  SIMULATION  WITH  WATER  TUNNEL. 

Experiment  evidence  from  the  present  work  and  previous 
investigations  indicate  that  there  are  several  parameters  pertinent 
to  the  results  of  mixing.  For  the  near  field,  the  parameters  are 
1)  diameter  ratio  d/D,  2)  number  of  injection  ports,  3)  momentum 
ratio  J,  and  4)  the  Prandtl  number.  For  the  far  field,  the  mixing 
is  determined  by  1)  the  initial  scale  of  segregation,  2)  the 
dissipation  rate,  3)  the  Prandtl  number  (for  heat  transfer)  or 
Schmidt  number  (for  mass  transfer),  and  4)  the  Reynolds  number 
(for  water  mixer) .  The  effect  of  the  second  group  of  parameters 
can  be  summed  up  by  a  time  ratio  t^/t^.  These  parameters  can  be 
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Fig.  14  Comparison  of  scalar  fi 
(Vi/Vo  -  5,  Re  -  30,000 
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used  as  the  similarity  criteria  to  examine  whether  mass  transfer 
process  in  a  water  mixer  can  be  used  to  simulate  the  heat  transfer 
process  in  an  air  mixer. 

As  far  as  the  near  field  is  concerned,  the  four  dimensionless 
parameters  can  easily  be  matched  between  the  two  types  of  mixers 
except  the  Prandtl  number.  Such  effect  can  be  seen  in  Figs  13,  14 
and  15  when  x/d  =  6.  The  scalar  field  is  much  less  uniform  for 
water  mixer  because  the  mass  diffusion  coefficient  is  much  smaller 
than  the  thermal  diffusivity.  These  results  show  that  water  mixer 
cannot  simulate  the  air  mixer  in  the  near  field. 

In  the  far  field,  the  similarity  parameter  is  the  time  ratio 
tg/tj..  Again  because  of  the  difference  between  Pr  and  Sc,  the  time 
constant  for  mass  transfer  in  a  water  mixar  is  much  larger  than 
that  for  heat  transfer  in  an  air  mixer.  In  other  words,  the  speed 
of  turbulent  mixing  in  water  is  much  slower  than  that  in  gas.  It 
may  first  appear  that  similarity  is  attainable  by  adjusting  the 
length  of  the  mixer  to  achieve  identical  t„/t^.  Such  a  method  would 
be  a  valid  one  if  for  the  two  types  of  mixer  would  remain  the 
same  when  the  operating  conditions  change.  However,  the  analytical 
result  of  Beek  et  al.  (1959)  showed  that  this  was  not  the  case. 

According  to  Beek  et  al.,  the  mixing  distance  (a  distance 
required  to  attain  certain  degree  of  mixing)  in  an  air  mixer  is 
quite  sensitive  to  the  number  of  injection  ports,  but  is  rather 
insensitive  to  the  Re,  ^.olds  number.  On  the  other  hand,  the  result 
of  mixing  in  a  water  mixing  is  quite  different.  It  is  rather 
insensitive  to  the  number  of  injection  ports,  but  very  sensitive  to 
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the  Reynolds  number.  Such  a  difference  is  caused  by  the  difference 
in  the  correlation  between  t^,  and  other  flow  parameters  (see  Eqs. 
(3)  and  (4))  for  the  two  fluids.  Thus  for  a  given  Reynolds  number 
and  injection  ports,  one  may  match  the  time  ratio  of  the  two  mixers 
to  obtain  similarity  in  the  far  field.  However,  similarity  would 
break  down  in  the  same  mixers  if  the  flow  rate  or  the  number  of 
injection  port  is  changed. 

Thus  the  validity  of  using  mass  transfer  process  in  a  water 
mixer  to  simulate  the  heat  transfer  process  in  an  air  mixer  is  open 
to  question.  Indeed,  our  experiment  measurement  has  shown  that  the 
scalar  fields  from  these  two  mixers  are  quite  different. 

VII.  CONCLUSIONS 

The  conclusions  from  the  studies  are: 

1.  Parameters  pertinent  to  the  mixing  in  the  near  field  include: 

a.  the  diameter  ratio  d/D  and  the  number  of  injection  ports 

b.  the  momentum  flux  ratio  pjV^^  /poV^^^ 

c.  the  Prandtl  number  (for  heat  transfer)  and  Schmidt  number 
(for  mass  transfer) .  These  two  parameters  were  found  to 
play  a  significant  role  in  the  spread  of  the  scalar  fields 

2.  The  following  parameters  are  found  to  be  relatively 
unimportant  to  the  mixing  process 

a.  The  Reynolds  number:  within  the  re  ge  of  the  experiment, 

i.--'.,  30000  (  R^  (  88000  for  gas  mixer  and  12000  (  R^  ( 

30000  for  water  mixer,  the  effect  of  Reynolds  number  on 
the  near  field  is  insignificant. 

b.  The  Richardson  number.  When  the  velocity  of  the  primary 
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flow  satisfies  the  following  condition: 

the  effect  of  Richardson  number  is  found  to  be  negligible. 

3.  The  mixing  study  in  the  far  field  is  in  general  agreement  with 
the  prediction  of  Beek  et  al.  (1959)  ,  i.e. 

a.  the  mixing  in  gas  is  sensitive  to  the  number  of 
injection  ports,  but  is  ^ ither  insensitive  to  Reynolds 
number. 

b.  the  mixing  in  liquid  is  ra':.h  2r  insensitive  to  the  number 
of  injection  ports,  but  is  sensitive  to  the  Reynolds 
number. 

4.  The  heat  transfer  process  in  a  gas  mixer  cannot  be  duplicated 
closely  by  the  mass  transfer  in  a  water  mixer;  and  the 
difference  between  the  values  of  the  Prandtl  number  and 
Schmidt  number  is  the  major  cause  of  this  problem. 

VIII.  RECOMMENDATIONS 

It  is  thus  recommended  that  gas  should  be  used  as  the  test 
medium.  If  mass  transfer  is  used  to  simulate  the  process,  then  the 
Schmidt  number  of  the  test  mediums  should  be  close  to  the  Prandtl 
number.  The  present  study  should  be  further  extended  to  include 
the  following  aspects. 

1.  To  measure  the  mixing  distance  required  to  accomplish  the 
desired  degree  of  mixing.  To  do  so  the  set-up  used  in  this  work 
needs  to  be  upgraded,  such  as  increasing  the  length  of  the  mixing 
chamber  and  the  temperature  of  the  secondary  fluid. 
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2.  To  measure  the  fluctuating  scalar  field.  This  measurement 
would  provide  a  direct  comparison  with  Eq.  (5) . 

3.  To  evaluate  the  effect  of  density  ratio.  The  effect  of 
density  is  known  to  be  insignificant  when  the  density  ratio  is  less 
than  2.8.  However,  the  density  ratio  in  a  prototype  mixer  can 
reach  as  high  as  20,  and  the  effect  of  density  ratio  at  the  range 
is  unknown  and  further  experiment  is  required. 
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ABSTRACT 


Preliminary  computational  models  of  the  laser-induced 
fluorescence  spectrum  of  nitric  oxide  have  been  developed  for  two 
different  electronic  transitions: 

B^n  (v'=7)-x2n  (v"=0)  and  D^e  (v'=0)  -x^n  (v»=l) .  These 
codes  have  been  developed  for  a  single-photon  process  with  the 
spectrum  displayed  either  as  a  stick  spectrum  or  displayed  using 
a  Lorentzian,  Gaussian,  or  Voigt  line  shape.  Laser-induced 
fluorescence  spectra  of  nitric  oxide  at  room  temperature  and  at 
high  temperature  (up  to  62  3°K)  with  various  pressures  have  been 
recorded.  Comparison  of  the  experimental  results  with  predicted 
spectra  will  be  presented  herein. 
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I .  INTRODUCTION 

The  NO  molecule  is  a  stable  diatomic  molecular  radical.  The 
molecule  is  a  key  molecule  in  the  chemistry  of  the  upper 
atmosphere  and  an  important  but  dangerous  pollutant.  It  has  also 
been  detected  in  an  interstellar  molecular  cloud.  Moreover,  this 
molecule  is  important  to  combustion  chemistry  and  turbulent 
flows.  Accurate  measurements  of  temperature  and  species 
concentration  provide  the  fundamental  diagnostic  input  to 
characterize  combustion  system  and  turbulent  flows.  It  is 
obvious  that  this  molecule  can  provide  many  interesting  insights 
to  various  fundamental  processes. 

The  development  and  application  of  laser- induced 
fluorescence  as  a  diagnostic  technique  has  been  underway  for 
approximately  twenty  years.  This  experimental  method  can  be  used 
to  make  non-intrusive  measurements  in  combustion  systems  and  arc- 
heated  flow  fields.  Measurements  inside  a  combustion  system  or 
an  arc-heated  flow  can  be  used  to  determine  the  species  density 
and  temperature  as  well  as  to  assign  the  molecular  transitions. 

Study  of  laser-induced  fluorescence  of  nitric  oxide  was 
therefore  decided  on  as  the  subject  of  this  research  project. 
Single  photon  excitation  rather  than  multiphoton  excitation  was 
used.  The  ArF  excimer  laser  system.  Lambda  Physik  EMG  150  MSC, 
installed  at  AEDC  was  used  for  this  project. 
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II.  OBJECTS  OF  THE  RESEARCH  EFFORT 

For  the  overall  research  project,  several  individual  tasks 
were  done.  In  particular,  they  were: 

1.  Development  of  computational  models  for  electronic 
transitions ; 

2.  Observation  of  rotationally  resolved  laser  excitation 
spectra  at  different  temperatures  and  at  various 
pressures ; 

3.  Observation  of  vibrationally  resolved  laser  excitation 
spectra  at  different  temperatures  and  at  various 
pressures. 

III.  ENERGY  LEVELS 

In  order  to  understand  the  behavior  of  the  fluorescence 
signals  that  can  be  affected  by  temperature,  pressure,  and 
spectroscopic  constants,  computational  models  were  needed.  It 
was  decided  to  observe  and  characterize  the  fluorescence  spectra 
at  room  temperature  and  at  high  temperature  (1200'’K). 

According  to  the  relative  intensity  calculations,  it  was 
found  that  the  rotational  transition  of  the  B^n(v'=7)  -X^n(v"=0) 
electronic  transition  would  be  important  at  room  temperature, 
those  of  the  D^E(v'=0)  -X^n(v"=l)  electronic  transition  would  be 
dominant  at  high  temperature,  and  those  of  these  two  electronic 
transitions  would  appear  together  at  moderately  high  temperature. 
Therefore,  three  different  computational  models  of  fluorescence 
spectra  were  needed  for  this  project.  The  important  eguations 
for  the  rotational  energy  levels,  rotational  transition 
strengths,  and  line  shapes  will  be  briefly  described  below. 
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The  total  energy  of  a  diatomic  molecule,  excluding 
translation,  is 

E'^  =  Eg  +  E-y  +  Ej-, 

where  Eg/  Ev,  and  E^-  respectively  stand  for  the  energies  of  the 
electronic  state,  vibratioril  state,  and  rotational  state.  The 


second  two  terms  on  the  rignt  are  well  known.  They  are 
Ev  =  V^iv+h)  -  WgXg(v+i5)^  +Wgyg(v+Js)3, 

and 

Er  =  K(K+l)Bv  -  k2(K+1)2Dv, 


(1) 


where  v  and  K  are  vibrational  and  rotational  quantum  numbers,  Wg 
is  the  vibrational  energy  expressed  in  cm“^.  WgXg  and  WgYg  are 
the  enharmonic  constants,  and  are,  respectively,  the 
rotational  constant  and  centrifugal  distortion  constant  of  the 


vibrational  state  v. 


For  the  NO  molecule  the  electronic  spin  has  the  value  h 
which  produces  doublets.  One  labels  the  two  components  of  the 
doublet  by  subscripts  1  and  2,  so  that 
Fi(K) ;  fi(K) :J  =  K  +  ^, 

and 


F2(K) ;  f2(K) :J  =  K  -  h, 

where  K  and  J,  respectively,  represent  the  pure  rotationa] 
quantum  number  and  the  coupled  quantum  number.  Fi(K)  and  F2(K) 
are  used  to  represent  the  rotational  levels  of  the  electronic 
state,  while  fi(K)  and  f2(K)  for  those  of  the  ^IT  electronic 
state. 

Besides  the  spin  effect,  the  rotational  energy  levels  will 
also  be  affected  by  the  coupling  of  the  spin  and  the  momentum 
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created  by  the  rotation  of  the  nuclei.  The  rotational  energy 
levels  will  thus  be  modified.  They  are: 

Fi(K)  =  K(K+l)Bv-K2(K+l)^Dv+R{K+i5)  , 

2  2 

F2(K)  =  K(K+1)Bv-K'^  (K+l)"^Dv-R(K+i5)  , 
for  the  electronic  state  [1]; 
and 

fl(K)  =  [K(K+1)2-!2  J  4(K+l)^+a(a-4)  ]Bv  -  k2(K+1)2cv, 

(3) 

f2(K)  =  J  4 (K2+a(a-4) ] Bv  -  k2(K+1)2Dv, 

for  the  electronic  state  [2],  where  R  and  a  are  the  coupling 
constants.  They  are  determined  from  the  experimental  spectra. 

For  rotational  transition  probabilities,  two  sets  of 
equations  were  used.  The  formulas  expressed  explicitly  in  terms 
of  the  quantum  numbers  by  Earls  [3]  originally  obtained  by  Hill 
and  Van  Vleck  [4]  were  used  for  the  ^E  -  electronic 
transitions.  They  are: 

2J+1  *  ((2J+1)±  U( (2J+l)^-2a) ) 

2J+2 

2J-H  *  ((2J+1)±  U(  (2J-H)^+2  (a-4)  )  ) 

2J+2  J  ' 

2J+1  *  [(2J+l)^-2  ±  U( (2J-H) 3-8J-2a)  ] 

2J+2  J 

2J+1  *  [(2J+l)^-2  ±  U( (2J+1) 3-8J+2 (a-4)  )  ] 

2J+2  J 


"^21 

Rl 

*312 

Q2 

^21 

Qi 

^12 
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2J4-1  *  ((2J+1)  U(  (2J+1)  2-2a)  ] 
2J 


Pi 

P2  ^  2J+1  ^  ((2J+1)  U(  (2J-H)  2+2  (a-4)  ) 

Q21  ■  2J 

where  =  J  (2J+1) ^  +  a {a-4) 

and  a  is  a  coupling  constant.  The  +  sign  is  used  for  the  main 
branches  such  as  R2,  Ri,  etc.;  while  the  -  sign  is  used  for  the 
satellite  branches,  such  as  S21,  Qi2»  etc.. 

The  Hond-Londcn  formulas  for  AA  =  0  transitions  listed  in 
the  "Spectra  of  Diatomic  Molecules"  [5]  were  used  for  the 


electronic  transitions.  They  are; 

^  (J+l+A)  *  (J+l+A) 


(2J+1)  *  A 
J(J+1) 

(JtA)  *  (J-A) 


(5) 


where  J  and  (A=l)  respectively  stand  for  the  coupled  rotational 
quantum  number  and  electronic  state.  There  are  six  rotational 
transitions  within  these  two  p'ectronic  states; 

Pll'  ^22/  Qllf  Q2:u  P22- 

The  molecular  concentration  distribution  is  an  important 

facv-or  in  the  evaluation  of  tne  transition  intensities.  For  a 

given  electronic  state,  uhe  gas  density  of  the  rotational  state  K 

in  the  vibrational  state  is 

N(2K+1) 

=  -  o  o  +  Ev)hc/kT,  (6) 

Wr^v  ® 


where  Ev  is  the  vibrational  energy,  is  the  rotational  energy, 

T  is  the  temperature,  N  is  the  totaJ  gas  density  (molecules/cm^ ) , 


and  and  are  the  rotational  and  vibrational  state  partition 
functions  defined  as 

Qr  .  (2K+1)  e-BvK(K-^l)ho/kT_ 

and 

Q„  =  E 
^  v=0 


Here  the  term  stands  for  the  vibrational  energy  of  the  v 
state.  The  transition  intensity  factor  is  given  by  the 
product  of  Eqs.  (4)  or  (5)  and  (6) . 

The  Voigt  line  shape  applies  when  both  Doppler  and  pressure 
broadenings  are  significant,  for  example,  when  the  experiment  is 
carried  out  at  high  temperature  with  moderate  pressure.  Typical 
combustion  applications  are  of  this  nature,  therefore  the  Voigt 
line  shape  function  will  be  used  in  this  work.  Several  important 
expressions  of  this  function  will  be  briefly  described.  The 
transition  coefficient  at  a  wavenumber  due  to  a  transition  of 


wavenumber  A  is  given  as 


where 


li(^)  =  II  f  ekp(-t2) 

^  ’  J-»  (Xi-t)2+Yi2 


with  Pi  the  transition  intensity  factor  mentioned  previously  and 


oAi  is  the  Doppler  half-width  given  by 


and  '/i  is  the  Lorentz  half-width  determined  from  experiment; 
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Xi  =  ^[ln2  ; 

and  t  is  a  real  number.  The  net  coefficient  at  A  will  be 
I  (A)  =  Ili(A)  . 

IV.  COMPUTATIONAL  MODELS 

Three  codes  (PPTFST.FOR,  DPTEST.FOR,  and  TEST. FOR)  were 
developed  in  this  project,  and  they  were  used,  respectively,  to 
predict  the  fluorescence  signal  of  the  B^n(v'=7)  -  X^n(v"=0) 
electronic  transition  at  room  temperature,  the  fluorescence 
signal  of  the  (v'=0) -X^n  (v’'=l)  electronic  transition  at  high 
temperature,  and  that  of  the  combination  of  these  two  electronic 
transitions  at  moderately  high  temperature.  In  principle,  the 
structure  and  optional  functions  of  the  first  two  computational 
codes  are  the  same.  First  of  all,  the  rotational  energy  levels 
in  both  electronic  states  were  calculated.  Secondly,  the 
excitation  frequency  and  transition  intensity  factor  of  each 
rotational  line  were  calculated.  Besides  the  transition 
probability,  the  Boltzmann  distribution  and  Franck  Condon  Factors 
[6]  were  considered.  Because  of  the  same  lower  energy  level 
electronic  state,  the  partition  functions  for  rotational  and 
vibrational  states  were  not  used  here.  Third,  the  emission 
transition  with  its  transition  probability  were  calculated  for 
higa  energy  levels.  Each  rotational  energy  level  from  the  upper 
electronic  state  will  provide  either  six  (for  D^E  -  X^n  in 
PPTEST.FOR)  or  three  (for  B^fl  -  X^II  in  DPTEST.FOR)  emission 
transitions.  This  means  that  the  number  of  molecules  in  the 
higher  energy  level  will  be  split  into  three  or  six  parts. 
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Forth,  the  emission  transition  wavelength  (or  wavenumber)  may  be 
selected  in  a  proper  range  for  both  prediction  and  plot.  Each 
code  has  a  function  to  generate  the  information  for  plotting. 

Four  line  shapes  were  used  in  each  code.  They  were  stick, 
Lorentzian,  Gaussian,  and  Voigt  line  shapes.  The  stick  spectrum 
only  indicates  the  wavelengths  and  intensities.  The  procedures 
for  the  second  and  third  are  straightforward.  However,  that  of 
Voigt  line  shape  is  relatively  tedious.  Some  important  steps 
will  be  briefly  explained  below. 

The  integral  term  is  the  most  important  in  Voigt  line  shape. 
The  magnitudes  of  X^,  Y^,  and  t  play  an  important  role  in  this 
part.  The  magnitude  of  depends  upon  the  Lorentz  half-width  7^ 
and  Doppler  half-width  5Ai.  The  former  only  depends  upon  the 
experimental  date,  while  the  latter  depends  upon  the  transition 
wavenumber  and  temperature.  For  a  given  temperature  and  the 
short  wavenumber  regioii,  the  change  of  is  small,  so  is  that 
of  Yi.  The  nearly  constant  value  of  Y^  can  avoid  a  very  small 
magnitude  in  the  denominator.  Besides  the  Doppler  half-width, 
the  magnitude  depends  on  the  difference  between  thf.  individual 
transition  wavenumbers  and  the  wavenumbers  in  the  selected 
region.  The  integrand  is  an  even  function  of  t  (real  value) . 

The  magnitude  is  always  positive,  and  gradually  decreases  from 
the  symmetric  point  (tc^O)  .  For  convenience,  a  summation 
procedure  was  used  to  replace  the  integration.  It  was  found  that 
the  increment  in  t,  At,  and  the  upper  and  lower  limits  could 
affect  the  calculate  values  of  the  integral.  For  a  given  set  of 
and  Yj^  the  calculating  procedure  is  stopped  when  the 
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multiplied  magnitude  of  the  integrand  value  and  increment  is  less 
than  l.OE  -6.  Different  values  of  At  were  used  to  check  the 
calculated  value  in  the  summation  process.  When  At  was  less 
than  0.03,  less  than  a  1  percent  change  in  the  calculated  value 
in  the  integral  was  found,  and  the  value  of  At=0.03  was  used  in 
this  work.  The  upper  and  lower  limits  in  the  summation  were 
found  to  be  around  ±3.0. 

The  procedure  described  up  to  this  point  applies  only  for  a 
given  wavenumber  associated  with  a  particular  transition.  The 
same  procedure  is  then  used  for  the  rest  of  the  transitions.  The 
summation  of  the  individual  contribution  gives  the  effective 
spectral  intensity  factor  at  that  given  wavenumber  (date  point) . 
The  effective  spectral  intensity  must  then  be  calculated  for  each 
wavenumber  in  the  wavenumber  region.  The  selected  wavenumber 
region  was  from  51600  cm”^  to  51880  cm”^  in  this  project,  and  the 
wavenumber  separation  was  selected  a  0.06  cm“^  after  a  number  of 
tests.  This  produces  about  five  thousand  data  points,  which  can 
provide  a  very  smooth  plotted  spectrum.  The  procedure  was  used 
for  both  codes  PPTEST.FOR  AND  DPTEST.FOR.  Attempt  was  tried  to 
established  one  code  for  both;  however,  substantial  differences 
in  calculation  of  transition  intensity  factors  and  rotational 
energy  levels  made  it  difficult.  On  the  other  hand,  it  was  easy 
to  combine  the  fluorescence  rotational  transitions  and  intensity 
factors  respectively  calculated  in  both  codes  and  tc  plot  the 
predicted  spectra.  The  code  TEST. FOR  was  used  for  this  case. 
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V.  EXPERIMENTAL  RESULTS 

a.  Rotationally  resolved  laser  excitation  spectnu"  at 
room  temperature 

In  order  to  understand  the  effect  of  pressure  on  the 
spectrum,  various  pressures  were  used.  The  laser 
was  scanned  from  192. 8nm  to  193. 8nm  to  generate  the 
electronic  rotational  transitions.  A  spectrometer 
set  at  208nm  and  also  a  PMT  with  a  band  pass  filter 
were  used  to  record  the  rotational  spectra.  The 
spectra  at  pressures  of  0.5  torr  and  10.0  torr 
recorded  by  the  PMT  are  shown  in  Figures  1  and  2, 
respectively.  It  is  easy  to  see  that  the  four 
groups  of  transitions  were  only  slightly  affected  by 
pressure,  and  the  transitions  marked  with  (*)  were 
significantly  affected  by  pressure.  The  four  groups 
are  due  to  the  B^n  (v'=7)“X^n  (v”=0)  transitions. 

The  predicted  spectrum  with  an  assumed  Voigt  line 
shape  is  shown  in  Figure  3 .  Comparison  of  these 
three  figures  indicates  that  the  observed  and 
predicted  spectra  are  consistent.  The  spectrum  in 
the  same  region  has  been  reported  previously  by 
Wodtke  et  al[7].  For  convenience,  the  predicted 
transitions  with  wavelengths  are  listed  in  Table  I. 
The  line  strengths  of  the  Q-branch  transitions  are 
inversely  proportional  to  the  rotational  quantum 
numbers;  these  lines,  therefore,  are  weak  in  this 
wavelength  region. 


17-13 


The  spectroscopic  constants  previously  determined 

[8]  were  used  without  any  change  for  our  models. 

They  are  listed  in  Table  II. 

Rotationally  resolved  laser  excitation  spectrum  at 
high  temperature 

Mixtures  of  NO  and  N2  were  sent  through  an  electri¬ 
cally  heated  tube  to  the  cell.  The  NO  in  the 
mixture  was  approximately  5%  or  less.  Various 
temperatures  were  reached,  and  the  rotationally 
resolved  laser  excitation  spectra  at  different 
temperatures  were  recorded.  The  recorded  spectrum 
at  the  highest  temperature,  623“K,  is  shown  in 
Figure  4. 

Comparing  Figure  4  with  Figures  1  and  2,  indicates 
that  the  aforementioned  four  groups  were  slightly 
changed  whereas  those  marked  with  (*)  were  changed 
more.  In  order  to  understand  more  about  the 
observed  spectra,  the  predicted  spectra  with  a  Voigt 
line  shape  were  calculated  for  the  (v'=0)-X^n 
(v''=l)  transition.  According  to  the  predicted 
spectrum  shown  in  Figure  5,  most  of  the  transitions 
marked  with  (*),  shown  in  Figure  2  were  due  to  this 
electronic  transition.  For  this  electronic 
transition,  in  addition  to  the  Pll,  Qll,  RI  ,  P22, 
Q22,  and  R22  transitions  reported  by  Andresen  et  al 

[9] ,  the  P12,  Q12,  Q21,  S21,  R21,  and  012 
transitions  are  also  important  as  indicated  by 
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Scheingraber  and  Vidal  [10].  The  assumed 
spectroscopic  constants  for  these  two  electronic 
states  are  listed  in  Table  II,  and  the  predicted 
transitions  with  wavelengths  are  listed  in 
Table  III.  The  calculated  spectra  from  both 
PPTEST.FOR  and  DPTEST.FOR  are  consistent  with  the 
published  measurements  [11].  The  plotted  spectrum 
at  high  temperature  from  DPTEST.FOR  is  also 
consistent  with  the  observed  spectrum  [9]. 
c.  vibrationally  resolved  fluorescence  transitions  at 
room  temperature 

For  fluorescence  spectra,  besides  the  rotational 
transitions,  the  vibrational  transitions  between  two 
electronic  states  are  also  important.  In  order  to 
understand  the  effects  of  pressure  and  temperature 
on  the  vibrational  transitions,  various  pressures 
and  temperatures  were  used  in  this  research.  The 
laser  wavelength  was  fixed  at  any  of  the  observed 
lines  listed  in  Table  I.  The  spectrometer  was 
scanned  slowly  from  the  laser  wavelength  to  330nm  to 
record  the  vibrational  transitions.  A  rotational 
transition  at  the  fixed  input  laser  wavelength  was 
excited,  producing  the  vibrational  fluorescence 
spectrum.  The  vibrational  spectra  at  pressures  of 
1.0  torr,  and  10.0  torr  at  laser  wavelength  set  at 
193.30nm  are  shown  in  Figures  6  and  7,  respectively. 
Of  the  transitions  listed  in  Table  I,  one  or  more  of 
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four  transitions  Rn  (29  29^5  -*  30  30^)  , 

Pll  (26  26^5  -  25  25^),  R22  (28  21\  ^  29  28^5),  and 
P22  (25  24^  -»  24  23^)  were  excited.  The 
vibrational  fluorescence  spectrum  shown  in  Figure  6 
is  consistent  with  the  previously  reported  one  [12]; 
however,  some  expected  vibrational  transitions  are 
still  weak  in  Figure  6  due  to  effects  of  pressure. 
The  high  pressure  vibrational  spectrum  shown  in 
Figure  7  shows  more  vibrational  transitions  and 
stronger  intensities.  This  is  not  surprising, 
because  increasing  the  pressure  increases  the 
molecular  density,  and  hence  the  intensities, 
d.  Vibrationally  resolved  fluorescence  transitions  at 
high  temperature 

The  process  used  to  heat  the  cell  is  the  same  as 
mentioned  previously.  The  vibrational  laser  induced 
fluorescence  spectra  at  different  temperatures  were 
recorded.  The  one  at  623'’K  with  laser  ’’"v^length 
fixed  at  193.30nm  is  shown  in  Figure  It  is 

obvious  that  the  spectrum  has  even  more  transitions 
and  even  stronger  intensities  than  that  at  room 
temperature.  Because  the  rotational  quantum  numbers 
shown  in  Table  I  are  moderately  high,  the  population 
in  these  rotational  energy  levels  should  not  be 
large  at  room  temperature.  When  the  temperature  is 
increased,  however,  the  population  in  these  high 
energy  levels  will  be  increased.  Moreover,  the 
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population  in  the  rotational  energy  levels  of  the 
other  vibrational  state  (v''=l)  will  be  increased  at 
high  temperature.  Consequently  the  transitions 
intensities  will  be  increased. 

VI .  RECOMMENDATIONS 

Comparison  of  the  experimental  results  recently  obtained  at 
AEDC  with  published  reports  [7,9,12]  shows  that  the  laser  system 
works  properly.  Several  experimental  projects  can  be  finished  at 
AEDC  in  the  near  future  using  this  laser  system.  In  particular, 

1.  Fluorescence  spectra  of  rotational  and  vibrational 
transitions  at  higher  temperatures.  So  that  the  basic 
experimental  results  can  be  extended  to  temperature 
regions  of  interest  to  DoD.  Those  results  will  provide 
useful  information  for  basic  scientific  research. 

2.  Fluorescence  spectra  of  arc-heated  flows  at  AEDC. 
Determination  of  temperature  and  molecular  density  is  of 
importance  for  the  mission  of  AEDC.  Fluorescence 
spectra  can  be  used  for  determination  of  temperature  and 
molecular  density.  When  finished,  these  experimental 
results  will  provide  useful  information  for  both  basic 
research  and  diagnostic  applications. 

3.  Besider'  nitric  oxide,  the  laser  system  can  be  used  to 
investigate  the  spectra  of  other  molecules,  such  as  OH. 
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Figure  5.  Predicted  rotational  spectrum  of  NO  at  SOO'K.  Tt^e 
spectrum  is  the  combination  of  the  B  n(v'=7)  —  X  n 
(v''=0)  and  d2e(v'=0)  -  electronic 

transitions. 


Wavelength  (nm) 

Figure  6.  Observed  vibrationally  resolved  fluorescence 

spectrum  of  NO  at  room  temperature  and  a  pressure 
of  1.0  torr  with  A  =  193.30nm. 
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Figure  7.  Observed  vibrationally  resolved  fluorescenc 
spectrum  of  NO  at  room  temperature  and  a 
pressure  of  10.0  torr  with  A  =  193.30nm. 


Figure  8.  Observed  vibrationally  resolved  fluorescence 
spectrum  of  NO  at  temperature  of  623°K  and 
A  =  193.30nm. 


Table  I.  Predicted  Rotational  Spectrum  for  the  B'lT  {u'  —  7)  — X'!!  {i/"  =  0) 
transitions  of  Nitric  Oxide  between  192. S  and  193. S  nm. 


lower  rotational  energy  level 

wavelength  (nm) 

Rll(29  29  1/2) 

192.985 

Pll(26  26  1/2) 

192.993 

R22(2S  27  1/2) 

193.015 

P22(25  24  1/2) 

193.031 

Rll(30  30  1/2) 

193.128 

Pll(27  27  1/2) 

193.136 

R22(29  2S  1/2) 

193.151 

P22{26  25  1/2) 

193.165 

Rll(31  31  1/2) 

193.273 

Pll(28  28  1/2) 

193.284 

R22(30  29  1/2) 

193.289 

P22(27  26  1/2) 

193.305 

Rll(32  32  1/2) 

193.428 

R22(31  30  1/2) 

193.434 

Pll(29  29  1/2) 

193.437 

P22(2S  27  1/2) 

193.451 

R22(32  31  1/2) 

193.5S4 

Rll(33  33  1/2) 

193.5S6 

Pll(30  30  1/2) 

193.595 

Pll(30  30  1/2) 

193.601 

i/-.:5 


Table  II.  Assumed  Spectroscopic  Constants  of  Nitric  Oxide. 
Units  are  in  cm~l 


=  7)  state 
Te:  45868.53 
We:  1037.45 
WeXe:  7.47 
WeYe:  0.073 
Be:  1.031 
De:  4.9  x  10“® 

=  0)  state 
Te:  123.16 
We:  1904.4 
WeXe:  14.19 
WeYe:  0.024 
Be:  1.696 
De:  5.3  x  10"^ 


D2X(C  =  0)  state 
Te:  53030.00 
We:  2323.90 
WeXe:  22.90 
WeYe:  0 
Be;  1.987 
De:  5.8  x  10"^ 

X^n(z/"  =  1)  state 
Te:  122.95 
We:  1904.4 
WeXe:  14.19 
WeYe:  0.024 
Be:  1.686 
De:  5.3  x  10"® 


4 


i 


i 
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Table  III,  Predicted  Rotational  Spectrum  for  tlie  =  0)-,X-n(i/"  =  i) 

transitions  of  Nitric  Oxide  between  192.8  and  193.8  nm. 


lower  rotational  energy  level 

wavelength  (nm) 

Qll(35  35  l/2)+Q21(35  35  1/2) 

193.142 

Q22(37  36  1/2)+Q12(37  36  1/2) 

193.157 

R22(31  30  1/2) 

193.179 

P22(43  42  l/2)+Pl2(43  42  1/2) 

193.209 

Pll(41  41  1/2) 

193.211 

S21(23  23  1/2) 

193.213 

Rll(28  28  l/2)+R21(28  28  1/2) 

193.222 

Qll(34  34  l/2)+Q21(34  34  1/2) 

193.223 

Q22(36  35  l/2)+Ql2(36  35  1/2) 

193.246 

R22(30  29  1/2) 

193.268 

Pll(40  40  1/2) 

193.293 

S21(22  22  1/2) 

193.293 

P22(42  41  l/2)+Pl2(42  41  1/2) 

193.298 

Rll(27  27  l/2)+R21(27  27  1/2) 

193.300 

Qll(33  33  l/2)+Q21(33  33  1/2) 

193.301 

Q22(35  34  l/2)+Ql2(35  34  1/2) 

193.332 

R22(29  28  1/2) 

193.354 

S21(21  21  1/2) 

193.370 

Pll(39  39  1/2) 

193.372 

Rll(26  26  l/2)+R21(26  26  1/2) 

193.375 

Qll(32  32  l/2)+Q21(32  32  1/2) 

193.378 

P22(41  40  l/2)+P12(41  40  1/2) 

193.385 

Q22(34  33  l/2)+Ql2(34  33  1/2) 

193.417 

R22(28  27  1/2) 

193.439 

S21(20  20  1/2) 

193.445 

Rll(25  25  1/2)4-1121(25  25  1/2) 

193.447 

Pll(38  38  1/2) 

193.449 

Qll(31  31  l/2)-pQ21(3l  31  1/2) 

193.450 

P22(40  39  l/2)  +  P12(40  39  1/2) 

193.469 

Q22(33  32  l/2)+Q  12(33  32  1/2) 

193.498 

Rll(21  24  l/2)  +  R21(24  24  1/2) 

193.517 

S21(10  19  1/2) 

193.518 

011(30  30  1/2) -1-02 1(30  30  1/2) 

193.520 

R22(27  26  1/2) 

193.521 

i’l  1(37  37  1/2) 

19.3.52.3 
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ABSTRACT 


One  of  the  most  important  molecules  that  can  be  used  to 
monitor  exhaust  gases  via  spectroscopic  means  is  H2O.  To 
accomplish  this  task,  one  needs  to  be  able  to  model  the  spectrum 
of  hot  H2O.  Despite  the  complexity  of  this  problem,  we  have  made 
substantial  progress  by  deriving  a  theory  for  the  absorption  of 
radiation  by  the  far  wings  of  spectral  lines.  Initial  validation 
of  these  theoretical  results  have  been  carried  out  in  the  medium 
infrared  spectral  region,  and  the  agreement  between  theory  and 
experiment  is  generally  very  good.  The  relevance  of  this  work  to 
the  determination  of  temperatures  is  discussed. 


I .  INTRODUCTION 


In  order  to  ascertain  information  about  temperature 
gradients  and/or  concentration  distributions  of  molecular  species 
in  exhaust  gases  using  spectroscopic  techniques,  one  needs  to 
know  (as  a  minimum)  the  positions  and  strengths  of  the  radiative 
transitions  involved.  For  many  species  this  information  is  not 
available  directly  from  laboratory  measurements  because  of  the 
conditions  under  which  most  experiments  are  performed,  viz. 
relatively  low  temperatures.  The  extrapolatron  of  this 
laboratory  data  to  the  high  temperatures  existing  in  exhaust 
gases  is  not  a  simple  problem,  especially  for  polyatomic 
molecules,  because  of  the  existence  of  both  highly  excited 
rotational  levels  and  a  large  number  of  hot  bands  that  are 
present  in  the  spectra  of  hot  gases.  In  the  present  report  we 
shall  describe  briefly  some  progress  that  has  been  made  in 
compiling  a  spectroscopic  data  base  that  would  enable  one  to  model 
the  spectra  of  several  exhaust  species. 

One  of  the  main  molecules  of  interest  is  H2O.  However, 
because  of  the  complexity  of  the  problem  of  modeling  the  spectrum 
of  hot  water  vapor,  an  adequate  data  base  is  not  currently 
possible,  although  substantial  progress  has  been  made  as  will  be 
discussed  later.  On  the  other  hand,  accurate  frequency  and 
intensity  data  for  several  Isotopic  species  of  CO  were 
calculated;  these  would  enable  one  to  model  the  vibration- 
rotational  spectrum  of  CO  up  to  several  thousand  degrees.  This 
data  was  already  made  available  to  my  research  colleagues  at 
Arnold  Development  Center,  and  it  will  be  included  in  the  HITEMP 
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data  base  currently  being  assembled  at  the  Air  Force  Geophysics 
Laboratory,  Hanscom  Air  Force  Base.  A  similar  data  base  for  the 
Schuman-Runge  electronic  transitions  of  O2  has  been  completed  and 
made  available  to  the  workers  at  Arnold  Development  Center. 

In  view  of  the  above,  rather  than  reviewing  this  work  on 
diatomic  molecules  that  has  been  completed,  we  will  focus  mainly 
on  the  progress  that  has  been  made  on  the  problem  of  hot  water 
vapor.  In  the  following,  we  will  first  describe  work  done  in 
collaboration  with  a  graduate  student  (Q.  Ma)  on  the  calculation 
of  atmospherically  important  "water  continuum".  The 
extrapolatron  to  high  temperatures  and  its  relevance  to  the 
monitoring  of  exhaust  gases  will  also  be  briefly  discussed. 

Then  we  will  discuss  some  recently  completed  experimental  work 
and  theoretical  analysis  carried  out  in  collaboration  with  J.-M. 
Hartmann  of  the  Laboratoire  d ' Energetique  Moleculaire  et 
Macroscopique  Combustion,  I'Ecole  Centrale  in  Paris,  Frnnce. 
Together  this  body  of  work  constitutes  an  important  step  towards 
the  goal  of  a  quantitative  understanding  of  the  spectrum  of  hot 
water  vapor. 

II.  THE  WATER  SPECTRUM 

A.  The  "Water  Continuum" 

The  conti  uum  absorption  of  radiation  in  the  infrared 
windows  of  the  Earth's  atmosphere  has  been  known  for  a  long 
time.^  The  history  of  this  topic,  together  with  many  references 
to  earlier  work,  has  been  reviewed  recently  by  a  number  of 
authors. There  is  nearly  unanimous  agreement  on  the 


temperature  dependence  (strong,  negative),  but  there  is 
considerable  disagreement  as  to  its  magnitude  and  the  physical 
mechanism  responsible  for  the  absorption.^  Water  dimers, 
collision-induced  absorption,  and  the  superposition  of  the  far 
wings  of  collisionally  broadened  lines  have  all  been  proposed  as 
possible  candidates.  While  the  first  two  mechanisms  are 
undoubtedly  present  and  contribute  to  the  absorption  (in  varying 
amounts  in  the  different  spectral  regions),  it  is  now  generally 
accepted  that  the  major  contribution  to  the  absorption  coefficient 
in  the  spectral  region  between  300-1100  cm~^  is  due  to  the  wings 
of  the  strong  self-broadened  pure  rotational  transitions  of 
water. 

We  have  derived  a  theory  valid  in  the  far  wings  of  water 
transitions  that  has  as  input  only  well-known  molecular 
parameters;  this  theory  enables  us  to  calculate  the  absorption 
due  to  overlapping  of  far  wings  from  the  millimeter  region 
through  the  infrared.^’®  The  results  of  such  a  calculation  are 
shown  in  Figure  1  for  three  temperatures:  T  =  296  K  (solid 
curve),  T  =  338  K  (dashed  curve),  and  T  =  430  K  (dotted  curve). 

We  have  also  carried  out  similar  calculations  for  higher 
temperatures  (T  =  760  K)  more  relevant  to  exhaust  gases. 

Obviously,  the  temperature-dependence  of  these  results  could  be 
used  in  conjunction  with  broadband  measurements  in  the  regions 
between  the  vibration-rotational  bands  (eq.  near  1000  cm“^, 

2800  cm”^,  etc.)  to  determine  the  temperature  of  hot  water  vapor. 
More  work  is  needed,  however,  to  verify  our  results  using 
laboratory  data,  before  such  a  quantitative  comparison  could  be 
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made.  Initial  attempts  in  the  medium,  infrared  region  will  now 
be  described. 

B.  Medium  Infrared  Measurements  and  Analysis 

Experimental  measurements  have  been  made  in  the  high 
frequency  wings  of  the  ^^2*  ^3^  -triad  for 

temperatures  and  pressures  in  the  500-900  K  and  0-70  atmospheres 
ranges.  Because  these  results  will  be  published  in  the  near 
future,^  we  will  not  give  the  experimental  details  here,  rather 
we  will  concentrate  on  the  results.  In  Figure  2  we  present 
experimental  results  for  the  continuum  absorption  (circles)  in 
the  400-900  cm~^  frequency  range.  The  dotted  curve  gives  the 
results  calculated  from  our  theory  to  be  compared  with  the  dashed 
curve  which  one  would  obtain  using  the  conventional  Lorentz 
lineshape;  clearly  the  new  results  are  a  vast  improvement  and 
lead  to  excellent  agreement  at  high  frequencies  where  the 
assumptions  in  the  theory^’®  are  valid.  Also  shown  is  an 
improved  fit  (solid  line)  based  partly  on  the  laboratory  data. 
Similar  results  have  been  obtained  in  other  spectral  regions.^ 

III.  DISCUSSION  AND  CONCLUSIONS 

Substantial  progress  has  been  made  in  deriving  a  theory  for 
the  calculation  of  the  absorption  by  hot  water  vapor,  primarily 
in  the  regions  between  the  strong  vibration-rotational  bands.  We 
have  validated  the  theoretical  calculations  by  comparison  with 
new  laboratory  data  obtained  at  elevated  temperatures  and 
pressures.  By  such  comparisons,  one  can  improve  the  results 
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and  model  quite  accurately  the  absorption  spectrum.  Because  of 
the  strong  temperature  dependence  of  the  absorption,  one  could  in 
principle  obtain  temperatures  from  the  measured  absorption. 

Despite  the  success  of  our  results, much  work  still 
remains;  in  particular,  the  calculation  of  the  absorption  due  to 
individual  vibration-rotational  transitions  that  would  enable  one 
to  obtain  density  and  temperature  data  from  high  resolution 
measurements  as  opposed  to  broad-band  ones.  Only  when  such 
results  are  available  can  they  be  combined  with  the  absorption 
arising  from  the  far  wings  in  order  to  predict  quantitatively  the 
spectrum  of  hot  water  over  a  wide  range  of  densities  and 
temperatures . 
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Abstract 


This  report  describes  the  research  accomplished  under  the  USAF-UES  Summer  Research 
Program  (SRP)  mini-grant  awarded  for  the  period  1  January  1990  to  31  December  1990.  The 
main  goal  of  this  research  was  the  analysis  of  the  dynamic  transient  properties  of  a  simple 
class  of  structurally  adaptive  signal  processing  systems.  The  work  performed  on  this  grant 
shows  that  even  a  very  simple  type  of  system  adaptation  (one  so  simple  that  it  would  not 
ordinarily  be  considered  a  structural  adaptation)  will  usually  require  a  dynamic  modification 
of  the  processing  structure  in  order  to  avoid  an  undesirable  transient  response  to  the  change. 
Additionally,  it  is  possible  to  predict  when  an  undesirable  transient  will  occur,  and  thus  to 
perform  the  structural  modification  to  avoid  it. 


19-2 


1  Acknowledgements 


I  would  like  to  thank  the  Air  Force  Systems  Command  and  the  Air  Force  Office  of  Scientific 
Research  for  sponsoring  this  research.  I  would  also  like  to  thank  Universal  Energy  Systems 
for  their  help  in  the  administrative  aspects  of  the  program.  Many  thanks  also  to  Arnold 
Engineering  Development  Center(AEDC)/DOTR  for  their  help  and  gracious  use  of  their 
equipment. 

This  work  is  also  heavily  dependent  on  the  work  of  Capt.  Ted  Bapty  for  his  help  and 
guidance  at  AEDC  and  Andrew  Koffman  for  his  help  in  performing  the  transient  analysis  of 
structural  adaptation.  Also,  I  must  thank  Dr.  Janos  Sztipanovits  for  his  guidance  in  leading 
me  into  the  area  of  structurally  adaptive  signal  processing  and  for  producing  his  brainchild, 
MULTIGRAPH. 


19-3 


2  Introduction 


A  complete,  parallel,  fully-integrated  programming  and  execution  environment,  the  Multi¬ 
graph  Programming  and  Execution  Environment  (MPEE),  for  structurally  adaptive  signad 
processing  systems  has  been  developed  at  Vanderbilt  University  [1]-[15].  This  environment 
provides  a  user-friendly  environment  for  designing,  programming,  and  executing  a  signal  pro¬ 
cessing  system  having  the  ability  to  modify  its  own  processing  structure  while  it  is  running. 
This  represents  a  significant  generalization  of  the  present  level  of  adaptive  systems  which 
merely  adjust  parameters  within  a  fixed  structure. 

It  is  possible  to  design  and  implement  structurally  adaptive  systems  with  existing  pro¬ 
gramming  languages  such  as  C  or  Fortran,  but  the  Multigraph  system  has  several  important 
advantages: 

•  It  is  a  generic  environment  for  designing  and  implementing  processing  systems  that 
are  representable  in  a  block  diagram  or  signal  flow  graph  form.  Therefore,  the  user  is 
able  to  organize  his  thoughts  in  a  natural  way  around  the  flow  of  the  signal  through 
the  system,  rather  than  in  the  unnatural  methods  of  subroutine  and  function  calls  so 
prevalent  in  standard  programming  languages. 

•  Also,  in  an  ordinary  programming  environment,  the  user  must  provide  his  own  mech¬ 
anism  for  deciding  which  processing  tasks  must  be  performed  and  in  what  order  they 
are  to  be  performed  (i.e.,  the  scheduling  of  processing  tasks  in  the  block  diagram  or 
signal  flow  graph).  In  the  MPEE  scheduling  is  simplified,  since  the  final  execution  envi¬ 
ronment  has  a  macro- dataflow  computational  structure,  and  thus  closely  resembles  the 
signal  flow  graph  of  the  processing  system.  The  result  is  that  the  execution  environment 
can  dynamically  determine  the  proper  scheduling  of  the  processing  tasks  on  its  own, 
and  free  the  user  from  this  tedious  task.  Actually,  in  a  structurally  adaptive  system 
this  task  is  more  than  just  tedious,  it  is  critical.  Since  the  overall  application  system 
can  change  its  processing  graph  structure,  the  schedule  will  change  during  run-time. 
This  means  that  the  dynamic  scheduling  capability  of  the  execution  environment  is 
crucial  for  the  proper  operation  of  the  overall  system.  This  capability  would  generally 
be  difficult  to  implement  with  standard  programming  languages,  and  would  require  the 
generation  of  a  dynamic  scheduling  system  for  each  new  application  program. 

•  In  addition,  the  macro- dataflow  computational  model  realized  by  the  execution  environ¬ 
ment  lends  itself  very  well  to  parallel  multiprocessor  implementations,  with  the  result 
that  such  implementations  have  been  successfully  tested. 

•  An  additional  advants^e  of  the  MPEE  is  its  hierarchical  approach  to  designing  large- 
scale  systems.  The  design  and  implementation  of  flexible  large-scale  signal  processing 
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systems  can  become  so  unwieldy  that  standard  programming  languages  quickly  be¬ 
come  impractical.  The  MPEE  is  well-suited  to  large-scale  systems,  because  higher-level 
graphic-oriented  modeling  facilities  for  signal  processing  structures  are  provided  in  the 
programming  environment.  Essentially,  large-scale  signal  processing  systems  can  be 
designed  by  “drawing  pictures”  on  the  computer  monitor. 

The  MPEE  provides  a  nearly  ideal  environment  for  designing  and  implementing  struc¬ 
turally  adaptive  signal  processing  systems,  but  more  work  needs  to  be  done  to  analyze  the 
signal  and  system  theoretical  behavior  of  this  class  of  adaptive  system.  In  particular,  the 
transient  behavior  of  such  systems  must  be  analyzed.  The  work  reported  herein  is  a  step 
toward  addressing  this  need. 

3  Research  Goals 

The  main  goal  of  this  effort  was  to  investigate  the  signal  and  system  theoretical  properties  of 
structurally  adaptive  signal  processing  systems  with  special  attention  to  the  characterization 
of  their  dynamic,  transient  behavior.  Since  the  designation,  structurally  adaptive  system 
(SAS),  is  much  too  general  for  any  in-depth  analysis,  the  method  to  be  used  to  study  this 
type  of  system  is  to  define  a  pool  of  problems  representative  of  different  kinds  of  SAS’s  and 
start  analyzing  these  examples. 

In  order  to  make  use  of  the  powerful  mathematical  tools  that  exist  for  analyzing  linear 
time-invariant  systems,  the  scope  of  this  effort  was  limited  to  studying  the  dynamics  of 
recongiguration  for  linear  filter  structures.  The  pool  of  experimental  systems  for  this  study 
that  fell  into  the  category  of  linear  time-invariant  (LTI)  filters  included  included  the  following 
structures: 

•  Recursive  transformations:  This  structure  looks  quite  promising  for  implementing  a 
broad  class  of  SAS’s.  Advantages  include  finite  transient  time,  a  simple  initialization 
process,  and  a  simple  method  of  modification  [17,  18]. 

•  Direct  form  I  and  II:  These  structures  are  easily  initialized  by  keeping  track  of  past 
values  of  input  and  output.  On  the  other  hand,  they  are  not  well  behaved  under 
coefficient  quantization,  and  stability  is  not  assured  in  a  simple  manner  (the  Schur- 
Cohn  stability  criterion  must  be  used). 

•  Lattice  filters:  These  filters  are  formed  of  a  cascade  of  sections  having  identical  struc¬ 
ture.  As  such,  the  order  of  the  filter  is  easily  increased  or  decreased.  Also,  they  are 
well  behaved  under  coefficient  quantization  and  stability  is  easily  assured  by  checking 
the  magnitude  of  the  reflection  coefficients. 
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•  Wave  digital  filters:  These  have  the  property  that  they  are  easily  designed  according 
to  frequency  domain  specifications,  and  they  have  robust  numerical  properties  [16]. 

•  Cascade  filter  structures:  Cascades  of  second-order  sections  have  several  desirable  qual¬ 
ities.  They  are  relatively  insensitive  to  coefficient  quantization  and  stability  is  easily 
assured.  Also,  this  type  of  structure  directly  represents  pole/zero  information. 

•  Parallel  filter  structures:  As  with  cascade  structures,  parallel  combinations  of  second- 
order  sections  are  relatively  insensitive  to  coefficient  quantization  and  stability  is  easily 
assured.  In  constrast,  however,  while  pole  information  is  directly  represented  by  these 
sections,  the  locations  of  the  zeroes  are  not  as  easily  identified. 

•  Composite  structures:  These  are  larger  signal  processing  systems  composed  of  inter¬ 
connections  of  linear  subsystems. 

Clearly,  the  study  of  all  or  even  most  of  these  structures  is  well  beyond  the  scope  of  a 
twelve  month  effort,  so  the  direct  form  and  lattice  structures  were  chosen  for  investigation 
and  comparison.  Through  these,  especially  the  direct  form  structures,  it  was  possible  to 
obtain  results  that  are  generally  applicable  to  LTI  systems  that  are  subject  to  dynamic 
reconfiguration  while  running. 

Further  reduction  in  scope  was  necessary.  The  set  of  all  possible  structural  changes  that 
may  be  implemented  on  a  running  filter  is  enormous.  Therefore,  the  simplest  possible  change, 
i.e.,  merely  changing  the  parameters  of  a  running  '  Iter,  was  chosen  for  the  initial  study.  Such 
a  small  change  requires  only  that  the  parameters  (or  coefficients)  of  the  filter  be  changed 
without  actually  changing  the  processing  structure  of  the  filter.  Strictly  speaking  this  is 
not  a  true  structural  adaptation,  since  the  processing  structure  does  not  change.  However, 
this  simple  modification  proved  to  require  dynamic  structural  modification  in  order  to  deal 
with  a  potentially  very  undesirable  transient  response.  Thus,  even  the  simplest  modification 
proved  the  need  for  the  ability  to  implement  dynamic  structural  modifications.  The  analysis 
performed  in  this  effort  focused  on  characterizing  the  transient  response,  predicting  when  it 
would  be  unacceptably  large,  and  exploring  methods  for  dealing  with  the  transient. 

4  Research  Performed 

The  work  reported  in  the  section  is  described  in  greater  detail  in  [20]  and  [21]. 

4.1  Transient  Response  of  a  Pole-Zero  System 

To  determine  the  conditions  under  which  undesirably  large  transient  behavior  can  occur  at 
the  output  of  a  system,  and  what  steps  if  any  can  be  taken  to  correct  these  conditions,  it 
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is  helpful  to  perform  a  mathematical  analysis  of  the  system.  Practical  digital  filters  can  be 
represented  in  the  Z-domain  as  a  ratio  of  two  polynomials  in  z,  thus  having  a  finite  number  of 
poles  and  zeros.  In  this  effort,  a  digital  filter  experiencing  an  abrupt  change  in  its  coefficients 
was  considered  to  be  a  pole-zero  system  with  non- zero  initial  conditions.  Thus  a  dynamically 
reconfigurable  digital  filter  will  have  a  non- zero  state  after  the  initial  point  of  ^configuration, 
i.e.,  the  new  filter  will  have  an  initial  state  that  corresponds  to  the  final  state  (and  thus  relates 
to  the  frequency  characteristics)  of  the  old  filter.  Therefore,  if  the  coefficients  are  abruptly 
changed,  compatibility  with  the  state  of  the  old  filter  is  an  important  issue. 

The  following  equations  describe  the  Z-domain  response  of  the  filter  after  the  point  of 
reconfiguration.  It  is  assumed  that  an  abrupt  change  of  the  coefficients  occurred  just  prior 
to  time  n  =  0.  Let  Y'*'{z)  be  the  one-sided  Z-transform  of  the  nonnegative  time  output 
signal  y(n),  X'^{z)  be  the  one-sided  Z-transform  of  the  nonnegative  time  input  signal  i(n), 
the  aie's  be  the  denominator  coefficients  of  the  new  filter,  and  the  b^'s  be  the  new  numerator 
coefficients.  If  the  new  filter  obeys  the  following  difference  equation  (for  n  >  0) 

N  M 

yi^)  =  -^o.kyin-k)  -}-  ^bkx{n-k)  (1) 

k=i  i=o 

then  the  one-sided  Z-transform  yields  [22], 


Simplifying,  it  is  straightforward  to  show  that 
y+(z)  : 


A{z)  ^  A{z)  A{z) 

.  ^0y(^) 

AiT)^ 


=  Hiz)X+iz)  +  YoAz)  +  Yoy{z). 


(3) 


where  ^4(2)  =1-1-  <‘'kz~^  and  B{z)  =  J2^Q^kZ~'’ ■  H{z)  is  the  new  transfer  function  of 

the  reconfigured  filter.  Yox{z)  and  Yoy{z)  are  the  transient  responses  due  to  the  states  from 
the  past  inputs  and  past  outputs  of  the  system,  respectively. 

It  can  be  shown  that  the  term  Noy(z)  depends  on  the  initial  states  of  the  output  and  the 
denominator  of  H{z).  The  numerator  of  yoy(^),  the  term  derived  from  the  initial  states  of 
the  output  at  time  n  =  0,  can  be  decomposed  as  follows: 

N  k 

^oy(2)  =  -'^(^kZ~‘‘C£,y{-n)z'^) 

fc=l  n=l 

=  -[«;2(0)-}-u;2(1)2“‘ +----h«^2(^-2)z-<^-2)-t-u;2(iV- (4) 

=  -W-i{z) 
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where  the  u;2(n)  terms  may  be  computed  in  the  following  way.  Let  yo(»i)  =  !/( +  0 

(where  i(n)  is  the  unit  impulse),  a(n)  =  -  i),  and  W2(n)  =  a(n)  *  yo(n),  then, 

W2(n)  may  be  found  from 

W2in)  =  W2(n)u{n)  (5) 

where  u(n)  is  the  unit  step  function. 


4.2  Analysis  of  the  Transient 


The  analysis  of  the  transient  response  of  the  system  will  be  based  on  the  preceding  decom¬ 
positions  of  the  system  components.  From  Equation  (3),  we  see  that  the  response  of  a  filter, 
after  the  point  of  abrupt  coefficient  change,  is  due  to  three  components.  The  first  term, 
H{z)X'^{z),  is  simply  the  response  of  the  new  filter  to  the  new  input  data.  Provided  the 
new  filter  is  stable,  this  term  should  not  produce  an  undesirably  large  transient.  The  other 
two  components,  Yqx^z)  and  YQy(z),  are  the  responses  resulting  from  the  past  states  of  the 
input  and  jutput  of  the  old  filter,  respectively.  These  final  states  could  be  set  to  zero  after 
a  change  of  coefficients,  but  this  would  cause  a  discontinuity  in  the  system  response,  which 
may  or  may  not  be  acceptable  The  past  state  of  the  input  is  not  likely  to  cause  a  large 
transient  [20].  The  coe^  i  it  change  does,  however,  cause  the  recursive  section  of  the  new 
filter  to  be  driven  b"'  t’. ;  past  output  states  of  the  old  filter.  The  past  states  of  the  output 
are  the  new  filter’s  link  to  the  old  filter  characteristics,  and  incompatibilities  between  the  old 
and  new  filters  will  be  manifested  in  the  term  YQy{z). 

Another  perspective  can  be  gained  from  looking  at  the  numerator  of  loy(^)  in  terms  of 
the  filtering  operations  mentioned  above.  Let  yb(2),  A{z),  W2{z),  W2{z),  and  U{z)  be  the  Z- 
transforms  of  yo(n)»  a(n)i  wizC”))  W2{n),  and  «(n),  respectively.  Then  from  the  decomposition 
of  YQy{z),  it  can  be  seen  that 


'  A[z)  A(z)  ’ 

which  by  partial  fraction  expansion  can  be  transformed  into  (assuming  simple  poles) 


(6) 


N 


N 


=  E  ^  =  E 

i=l  ^  V'  ,=  1  ^ 

The  partial  fraction  expansion  coeffic  mts  can  be  written  as 

^  _  ^2(^)(1  -  Pi2~^) 


A(z) 


(7) 


(8) 


z=p, 


In  order  to  consider  the  effects  these  coefficients  have  on  the  transient  respoi  ie,  Equa 
tion  (8)  may  be  considered  as  the  product  of  two  terms: 

1  - 


A,= 


Aiz) 


im(z)Up.- 


(9) 


Z=p, 
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The  numerator  of  the  first  term  cancels  the  root  of  A(z),  leaving  an  all-pole  expression. 
Since  the  expression  is  evaluated  at  2  =  pi,  the  first  term  will  grow  large  if  any  of  the  other 
poles  lie  close  to  the  pole,  or  if  the  filter  is  of  high  order  and  several  poles  lie  relatively 
near  the  pole.  Either  of  these  two  situations  can  lead  to  an  undesirably  large  transient 
behavior.  The  magnitude  of  .4,  will  also  be  affected  by  the  magnitude  of  the  second  term 
in  Equation  (9).  If  the  second  term  is  small  enough,  it  may  cancel  a  large  value  in  the  first 
term.  Otherwise,  a  large  transient  may  result.  Now  we  consider  the  magnitude  of  W2{pi). 

Since,  w^in)  equals  the  product  of  tU2(n)  (=  a{n)  *  yo(”))  «(n),  the  second  term 

in  the  expression  for  4,  can  be  evaluated  via  complex  convolution  of  Z-transforms.  Taking 
Pi  =  re^'^  and  evaluating  the  convolution  integral  on  a  circle  having  a  radius  equal  to  that  of 
Pi  results  in 

W^ipi)  =  ^  r  Uie^^^-^^)Aire^^)Yo(re^^)de.  (10) 

ZTT  j  — TT 

However,  a  simplification  results  from  examining  and 

4(re-'^)  =  (1  -  . .  .(1  -  . .  .(1  -  (11) 


Multiplying  and  A(r€->^),  we  see  that  the  term  of  A(re^^)  combines  with 

in  the  following  manner: 


1  _ 
I  _ 


(12) 


Thus,  the  product  simplifies  to 


[f(ej(^-«))4(re-'«)  =  JJ(i  _  p^r-^e-^'’)  =  (13) 

which  results  in 

W^ipi)  r  -€^^*-^^Q{re^^)Yo{re^^)d9.  (14) 

J^ir 

For  each  pole  of  the  filter,  VF2(p,)  is  a  factor  of  the  partial  fraction  expansion  coefficient. 
Since  the  transient  magnitude  is  directly  proportional  to  the  coefficient,  it  is  also  directly 
proportional  to  W2{pi)-  The  magnitude  of  this  integral  is  dependent  on  the  relative  energies 
of  the  two  terms.  It  is  desirable  to  have  the  energy  or  “high  regions”  of  Yo{re^^)  fall  in  the 
vicinity  of  zeros  of  Q(re-’^),  which  are  at  the  locations  of  the  poles  of  the  filter  (roots  of 
4(re-'®)).  Such  an  overlap  will  reduce  the  magnitude  of  the  transient.  Energy  in  Yo{re^^) 
that  is  amplified  by  Q{re^^)  will  tend  to  increase  the  size  of  the  transient. 

These  points  are  demonstrated  in  Example  I  which  examines  the  components  found  in 
the  convolution  integral  for  two  situations. 
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Example  1:  A  filter  is  redesigned  at  time  zero  to  a  fourth-order  narrowband  bandpass 
filter  (with  closely  spaced  poles  at  0.1  ±  jO.96  and  0.0999  ±  jO.9599)  driven  only  by  past 
outputs  of  an  unknown  filter.  The  main  interest  here  is  the  location  of  the  energy  contained 
in  the  spectrum  of  |yo(re-’®)|.  We  will  consider  the  cases  in  which  the  dominant  energy  of 
|Vb(re-'^)|  is  contained  essentially  near  the  ith  root  of  \A{re^^)\  (Case  1)  and  away  from  the 
ith  root  of  |A(re''®)|  (Case  2).  We  can  see  in  Figure  I  the  plots  of  |Q(re-’®)|  and  |Fo(reJ®)|  for 
Case  1.  Note  that  the  relative  maxima  of  |Q(reJ^)|  match  up  with  the  relative  minima  of 
|yo(re''®)|,  and  vice  versa,  disallowing  any  significant  amplification  of  energy.  The  resulting 
transient  response  of  the  filter  is  shown  in  Figure  2.  Figure  3  shows  the  plots  of  \Q{re^^)\ 
and  |yo(reJ®)|  for  Case  2.  In  this  case,  the  maxima  of  the  two  terms  are  closely  aligned.  The 
transient  response  of  the  filter  for  Case  2  is  shown  in  Figure  4.  This  transient  is  much  larger 
than  that  of  Case  1. 

4.3  Some  Possible  Reconfiguration  Methods 

Since  reconfiguring  (or  changing  the  coefficients  of)  a  digital  filter  by  abruptly  changing  the 
filter  coefficients  can  produce  a  large  transient  response,  certain  situations  call  for  other  more 
gradual  approaches  to  the  reconfiguration  problem.  This  section  considers  other  methods  for 
changing  the  filter  coefficients.  The  methods  examined  come  in  two  basic  forms:  interpolation 
methods  (whi  ncrementaUy  change  the  coefficients  from  the  old  filter  into  those  of  the  new 
filter)  and  dynamic  parallel  implementation  methods  (which  switch  filters  by  running  the 
new  filter  in  parallel  with  the  old  filter  for  a  time). 

4.3.1  Interpolation  Methods 

Possibly  the  simplest  choice  for  interpolating  between  filters  is  a  linear  interpolation  of  the  di¬ 
rect  form  filter  coefficients.  For  a  filter  with  a  transfer  function  of  the  form  H{z)  =  B{z)fA{z) 
we  define  a  time  period,  T,  over  which  the  coefficient  change  will  occur.  If  the  change  begins 
at  time  n  =  uq,  then  the  end  of  the  change  period  will  be  n  =  tiq  -f  T.  Thus,  the  filter 
coefficients  will  be  interpolated  linearly  according  to 

An{z)  =  "  Anew{z)  +  ^  - -Aold{z)  ,  Oq  <  n  <  Hq  -}-  T.  (15) 

The  coefficients  of  B{z)  are  interpolated  in  the  same  manner.  One  potential  problem  is  that 
as  the  coefficients  of  A(z)  are  interpolated,  the  corresponding  poles  cannot  be  guaranteed  to 
remain  within  the  unit  circle,  thus  stability  cannot  be  guaranteed  [20]. 

The  problem  of  instability  can  be  eliminated  by  interpolating  between  the  pole  locations 
of  the  old  and  new  stable  filters  in  a  manner  similar  to  the  previous  interpolation  method. 
The  numerator  can  be  changed  by  interpolating  filter  zeros  in  the  same  fashion  or  by  using 
the  coefficient  interpolation  described  above.  While  this  method  prevents  the  system  from 
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becoming  unstable  (provided  the  initial  and  final  filters  are  stable),  there  may  be  another 
factor  contributing  to  unacceptable  transient  behavior.  Pole-zero  systems  exhibit  a  sensi¬ 
tivity  to  shifts  in  pole  locations  which  can  cause  the  frequency  response  of  the  filter  being 
interpolated  to  develop  a  high  gain  or  otherwise  become  unacceptable  in  such  a  way  as  to 
produce  an  undesirable  transient  [20]. 

Another  method  for  trying  to  preserve  stability  is  the  interpolation  of  lattice  parameters. 
Instability  should  not  be  a  factor,  since  a  lattice  structure  with  reflection  coefficients  having 
magnitude  less  than  unity  will  always  be  stable.  Therefore,  given  initial  and  final  filters  that 
are  stable,  all  intermediate  filters  will  be  stable.  However,  unacceptable  transient  behavior 
can  still  exist  because  of  a  sensitivity  to  shifts  in  pole  locations  [20].  In  summary,  these  three 
interpolation  methods  were  examined  via  experiments  in  [20]  and  found  to  give  unacceptably 
large  transients. 

4.3.2  A  Dynamic  Parallel  Implementation  Method 

Clearly,  implementing  a  change  of  the  coefficients  of  a  filter  by  directly  modifying  the  coeffi¬ 
cients,  either  abruptly  or  gradually,  can  produce  undesirable  transient  behavior.  It  may  be 
possible  to  determine  a  sequence  of  filter  coefficients  that  will  not  produce  an  undesirable 
transient,  but  finding  that  sequence  is  not  a  trivial  task.  A  simpler  and  more  predictable 
technique  for  achieving  a  well-behaved  change  of  the  filter  coefficients  proceeds  as  follows. 
Build  a  new  filter  in  parallel  with  the  old  one,  forcing  the  new  filter’s  initial  states  to  be 
all  zeros,  then  perform  a  time-varying  weighted  sum  of  the  outputs  of  the  two  filters.  The 
time-varying  weighted  sum  consists  of  linearly  changing  the  values  of  the  amplifiers  at  the 
inputs  to  a  summer  over  the  switching  time,  T.  The  weight  associated  with  the  old  filter  will 
change  linearly  from  one  to  zero,  and  that  of  the  new  filter  will  change  linearly  from  zero  to 
one.  The  output  of  the  filter,  y(n),  during  the  switching  process  is 

y{n)  =  A:i(n)[/io;,<(n)  ♦  i(n)]  -f  fc2(n)[hneu/(n)  *  i(n)]  (16) 

where  k\  =  and  ^2  =  for  no  <  n  <  tiq  -f  T.  Assuming  both  filters  are 

properly  behaved,  no  undesirable  transient  will  be  produced.  The  implementation  form  for 
this  method  is  shown  in  Figure  5.  Figure  5(a)  represents  the  original  filter  prior  to  change. 
Figure  5(b)  shows  the  multipliers  that  are  varied  during  the  tapered  switch.  Figure  5(c) 
is  simply  the  new  filter  after  switching.  A  very  similar  dynamic  parallel  implementation 
method,  the  overlapping  sum  method,  is  given  in  [20].  Example  2  shows  a  tapered  switch 
applied  to  a  bandpass  filter. 

Example  2:  A  linearly  tapered  switch  implementation  of  a  bandpass  filter  redesign  has  been 
performed  over  the  time  interval  n  =  301  to  n  =  400.  The  transient  response  produced  is 
well-behaved  as  shown  in  Figure  6. 
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4.4  Determination  of  Transient  Size 


The  purpose  of  determining  the  size  of  the  transient  prior  to  implementation  of  the  filter 
change  is  to  determine  which  type  of  reconfiguration  to  implement.  If  transient  behavior 
never  becomes  undesirably  large,  an  abrupt  coefficient  change  may  be  perfectly  acceptable. 
However,  if  it  can  be  determined  that  the  transient  will  grow  large,  then  some  other  type  of 
change  technique  will  be  necessary. 

4.4.1  Exact  Calculation  of  Transient 

Section  4.2  showed  that  given  stable  old  and  new  filters  and  bounded  input,  any  undesirably 
large  transient  behavior  occurring  at  the  time  of  filter  change  will  be  largely  caused  by 
the  incompatibility  between  the  initial  output  st-ate  (from  the  old  filter)  and  the  new  filter 
coefficients.  The  transient  due  to  the  initial  output  states  can  be  determined  by  creating  a 
filter  with  transfer  function  driving  this  filter  with  an  impulse,  and  searching  for  the 

maximum.  This  computational  method  obtains  the  output  state  transient,  and  requires  only 
knowledge  of  the  filter  coefficients  and  initial  output  state. 

4.4.2  Estimate  of  Transient 

An  estimate  of  the  maximum  transient  of  yQy{n)  can  be  obtained  by  sampling  its  Z-transform, 
Yoy{2),  N  times  around  the  unit  circle  and  then  applying  an  inverse  fast  Fourier  transform 
(IFFT). 

Voy(^)  =  ^v(^)lz=expj^fc  A:  =  0,  1,  .  .  . ,  iV  —  1 
The  transient  estimate  is  then 


yoy{n)  =  IFFT{Yoy{k)}.  (17) 

The  largest  transient  value  would  then  be  estimated  by  maxn{|  iloyin)  |}.  This  estimate 
can  be  both  fast  and  have  predictable  (fixed)  computational  cost.  However,  sampling  the 
Z-transform,  Fby(’),  produces  an  aliased  inverse  transform  that  may  require  N  to  be  large  to 
diminish  the  aliasing.  N  may  not  have  to  be  large  to  obtain  a  useable  estimate,  and  practical 
results  have  been  obtained  for  values  as  small  as  N  =  8  [20]. 

4.4.3  Transient  Bounds 

Direct  Time-Domain  Bound:  Given  that  the  filter  implemented  after  reconfiguration  is 
a  stable  filter,  all  the  poles  will  lie  inside  the  unit  circle  and,  therefore,  have  magnitude  less 
than  one.  Thus,  we  see  from  the  time-domain  transient  response  that  the  largest  value  of  the 
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transient  will  always  be  less  than  the  sum  of  the  magnitudes  of  the  partial  fraction  expansion 
coefficients,  A,.  Therefore,  one  bound  for  the  transient  response  is  given  by 

N 

max|yov(n)|  <  (18) 

Tl 

«=i 

To  obtain  this  bound,  the  roots  of  the  denominator  polynomial  of  the  filter  will  need  to 
be  determined.  The  computational  cost  of  this  operation  (if  the  roots  are  unknown)  and  the 
closeness  of  the  bound  to  the  actual  transient  maximum  are  both  unknown. 

Fourier  Transform  Bound?  Another  bound  on  the  transient  can  be  obtained  by  sampling 
the  Fourier  transform  of  yoy{n)  from  its  Z-transform,  yoy(^)»  A  times  around  the  unit  circle 
and  taking  the  inverse  Fourier  transform.  The  discrete  Fourier  transform,  yoj,(A:),  which  is 
an  estimate  of  the  Fourier  transform,  Yoy^e^'^),  will  be  given  by 

yov(fc)  =  i'ov(-^)L=exp,^fc  ,  fc  =  0,l,...,iV-l.  (19) 

Applying  the  inverse  Fourier  transform,  taking  the  absolute  value  of  both  sides,  and  applying 
the  triangle  inequality  results  in 

|yov(«)l<4E  l^0v(fc)l  (20) 

fe=0 

for  all  n.  Comparing  the  cost  of  obtaining  this  bound  with  the  costs  of  obtaining  the  time- 
domain  bound,  the  estimate,  and  the  exact  calculation  of  the  transient,  this  bound  may  be 
the  least  expensive  to  compute.  No  rooting  of  the  filter  denominator  polynomial  is  required, 
no  search  for  a  maximum  is  required,  and  no  FFT  is  needed.  The  accuracy  of  this  bound 
will,  however,  depend  on  the  number  of  samples  N,  and  in  practice  it  has  been  observed  that 

this  bound  is  not  close  to  the  actual  maximum  unless  N  is  chosen  to  be  large  (e.g.,  N  =  1024 

or  2048). 

Comparison  of  Bound  and  Estimate  Accuracy:  Bound  and  estimate  data  for  the 
transient  term  due  to  the  initial  output  state  have  been  taken  for  a  fourth-order  narrowband 
bandpass  filter  (with  poles  at  0.1  ±j0.9  and  0.0975±j0.88).  For  the  purpose  of  comparing  how 
closely  the  bounds  and  estimate  follow  the  actual  peak  value  of  the  transient,  the  direct  time- 
domain  bound  and  the  Fourier  transform  bound  will  be  referred  to  as  bound  1  and  bound2, 
respectively.  Since  the  estimate  and  bound2  depend  on  the  number  of  samples  taken  around 
the  unit  circle,  N  (which  is  a  power  of  two  to  make  use  of  the  inverse  FFT),  data  has  been 
taken  for  different  values  of  N.  Table  1  contains  the  data  for  the  fourth-order  narrowband 
bandpass  filter  for  the  case  in  which  the  past  output  frequency  spectrum  peaks  only  near 
a  pole  of  the  reconfigured  filter  (similar  to  case  1  of  Example  1).  Recall  that  this  situation 
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will  produce  a  relatively  well-behaved  transient  response.  The  column  labeled  “peak”  refers 
to  the  actual  peak  value  of  the  transient  response.  Table  2  contains  data  for  the  same  filter 
for  the  case  in  which  the  past  output  frequency  spectrum  peaks  away  from  the  poles  of  the 
reconfigured  filter  (similar  to  case  2  of  Example  1). 

Of  the  two  bounds  and  the  estimate,  the  estimate  seems  to  follow  the  actual  peak  value 
of  the  transient  the  closest  for  both  filters  under  both  sets  of  initial  output  states.  This  is 
true  even  for  relatively  small  N.  The  time-domain  bound,  which  does  not  depend  on  iV,  is 
somewhat  inconsistent  in  its  closeness  to  the  peak  value.  The  Fourier  transform  bound  seems 
to  approach  the  transient  peak  for  large  N  but  can  be  significantly  greater  than  the  transient 
peak  for  small  N. 
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Q(re>)j*theU|)  and  yo(re-}J*UieUl) 


Figute  1:  |Q(re^*)|  and  Case  1. 

5  Conclusions  and  Recommendations  for  Future  Research 

There  is  much  left  to  be  done  in  order  to  obtain  a  deeper  understanding  of  the  transient  prop¬ 
erties  of  structurally  adaptive  signal  processing  systems.  The  work  of  this  effort  addressed 
one  of  the  simplest  possible  adaptations  and  showed  that  dynamic  structural  change  was 
necessary  to  deal  with  the  transients  produced  by  this  type  of  change.  Clearly,  this  should 
give  additional  demonstration  of  the  need  for  structural  adaptation  in  dynamically  changing 
systems. 

A  lo^cal  next  step  in  the  analysis  of  transients  is  to  consider  a  linear  filter  in  which  not 
only  the  parameters  but  also  the  order  (i.e.,  the  size)  of  the  filter  changes  during  run-time. 
Actually,  the  majority  of  the  analysis  of  this  case  is  going  to  be  the  same  as  that  described 
in  this  report.  The  main  difference  lies  in  determining  what  the  initial  state  of  the  new 
filter  will  be.  This  will  be  a  function  of  whether  the  new  filter  is  larger  or  smaller  than 
the  old  filter.  Another  research  direction  that  is  somewhat  related  to  this  one  is  the  case 
where  the  parameters  and  the  method  of  implementation  of  the  filter  chsmge.  For  example, 
a  filter  implemented  as  a  direct  form  n  structure  is  changed  to  a  different  filter  with  a  lattice 
structure.  In  this  case  the  question  of  transforming  the  final  state  of  the  old  structure  into 
the  initial  state  of  the  new  structure  should  be  investigated. 

On  a  larger  scale,  the  problem  of  redesigmng  entire  sections  of  a  running  large-scale 
structurally  adaptive  system  should  be  studied.  It  is  very  possible  that  a  scheme  much  like 
the  tapered  switch  or  overlapping  sum  methods  of  this  report  will  be  applicable  to  this  larger 
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Figure  6:  Filter  output  for  tapered  switch  redesign. 


TABLE  1 

BOUND  AND  ESTIMATE  DATA  FOR  FOURTH-ORDER 
NARROWBAND  FILTER:  CASE  1. 


N 

Boundl 

Bound2 

Estimate 

Peak 

8 

1.2228 

1.6341 

1.1715 

0.9993 

16 

1.2228 

1.3073 

0.6483 

0.9993 

32 

1.2228 

1.3873 

0.5215 

0.9993 

64 

1.2228 

3.4344 

2.5015 

0.9993 

128 

1.2228 

2.5588 

1.4318 

0.9993 

256 

1.2228 

2.2716 

1.0919 

0.9993 

512 

1.2228 

2.1495 

0.9837 

0.9993 

1024 

1.2228 

2.1419 

0.99 

0.9993 

2048 

1.2228 

2.1433 

0.999. 

0.9993 
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TABLE  2 

p'^tjnd  and  estimate  data  for  fourth-order 

NARROWBAND  FILTER:  CASE  2. 


N 

Boundl 

Bound2 

Estimate 

Peak 

8 

62.70 

8914.7 

20.00 

25.68 

16 

62.70 

4455.6 

20.00 

25.68 

32 

62.70 

2228.3 

20.00 

25.68 

64 

62.70 

1151.3 

19.70 

25.68 

128 

62.70 

558.9 

20.75 

25.68 

256 

62.70 

274.8 

19.73 

25.68 

512 

62.70 

135.6 

20.03 

25.68 

1024 

62.70 

67.7 

23.30 

25.68 

2048 

62.70 

33.9 

25.70 

25.68 

problem.  The  tapered  switch  and  overlapping  sum  methods  are  not  limited  to  LTI  systems 
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ABSTRACT 


A  very  important  requirement  of  a  communication  network  is  to 
deliver  data  from  a  source  node  to  a  destination  node.  This 
report  looks  at  several  types  of  network  layouts  (topologies), 
identifies  reasonable  reliability  measures  and  algorithms  which 
calculate  these  measures,  develops  useful  analytic  tools,  and 
finds  a  shortest-path  algorithm. 
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I.  INTRODUCTION 


A  very  important  requirement  of  a  communication  network  is  to 
deliver  data  from  a  source  node  to  a  destination  nodf’  in 
practice,  communication  networks  can  be  categorized  based  on  the 
architecture  and  techniques  used  to  deliver  the  data.  For  the 
purpose  of  this  report,  the  communication  networks  can  be 
regarded  as  packet  switching  networks,  but  the  issues  that  are 
addressed  here  are  applicable  to  other  types  of  networks.  Some 
excellent  references  in  this  field  are  Rosner  [1982],  Tanenbaum 
[1981]  and  Stallings  [1985]. 

The  physical  layout  of  a  communication  network  and  the  method  of 
routing  data  in  it  are  clearly  very  closely  related.  Tanenbaum 
suggests  a  number  of  desirable  attributes  in  the  routing  function 
of  a  communication  network;  correctness,  simplicity,  robustness, 
stability,  fairness  and  optimality.  Stallings  adds  efficiency  to 
this  list.  There  is  no  way  to  optimize  routing  over  all  these 
objectives;  there  is  always  a  trade-off  involved.  However,  the 
design  of  the  network  should  allow  for  routing  which  is  "good", 
however  the  "goodness"  may  be  measured. 

This  report  looks  at  several  types  of  network  layouts 
(topologies) ,  identifies  reasonable  reliability  measures  and 
algorithms  which  calculate  these  measures,  develops  useful 
analytic  tools,  and  finds  a  shortest-path  algorithm. 
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II.  COMPLEXITY  THEORY 


A  problem  which  requires  n  bits  to  be  fully  specified  is  said  to 
be  of  size  n.  If  an  algorithm's  running  time  cannot  be  bounded 
by  a  polynomial  function  of  n,  then  the  algorithm  is  called  an 
exponential  time  algorithm  and  is  said  to  be  intractable. 

Problems  that  are  called  NP-complete  form  an  equivalence  class  in 
the  following  sense:  if  a  polynomial  time  algorithm  exists  for 
one  of  the  NP-complete  problems,  then  every  'IP-complete  problem 
has  a  polynomial  time  algorithm.  As  of  now,  no  polynomial  time 
algorithm  has  been  found  for  any  of  the  known  NP-complete 
problems.  While  many  people  believe  the  conjecture  that 
NP-complete  problems  are  intractable,  it  has  not  been  proved  and 
it  is  the  most  important  open  question  in  current  complexity 
theory.  Many  network  reliability  problems  are  are  at  least  as 
hard  to  solve  as  the  NP-complete  problems. 
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Ill .  GRAPHS 


A  graph  G  =  (V,E)  consists  of  a  vertex  set 

•  V  =  {  v(l)  ,v(2)  ,  .  .  .  ,v(N)  } 

and  an  edge  set 

^  E  =  {  e(l) ,e(2) - ,e(M)  }. 

• 

Each  element  in  E  is  a  two-element  subset  of  V.  A  directed  graph 
(called  a  digraph)  is  a  graph  where  E  consists  of  ordered  pairs 
of  v(i) 's.  A  path  in  a  graph  is  an  alternating  sequence  of 

» 

i  vertices  and  edges,  where  the  vertices  and  edges  can  be  labelled 

so  that  if  the  path  is  written 

» 

v(l)  e(l)  v(2)  ...  e(k-l)  v(k)  then 

{  v(j),  v(j+i)  }  for  a  graph 

•  (  v(j),  v(j+i)  )  for  a  digraph. 

Vertex  1  and  vertex  k  are  said  to  be  connected  if  there  is  a  path 
as  above.  A  graph  is  connected  if  all  possible  vertex  pairs  are 
»  connected. 


At  any  time 
probability 
reliability 
probability 
edge  j  . 


of  interest,  the  event  "v(i)  is  operative"  has 
denoted  p(v(i))  or  p(i);  this  probability  is  the 
of  vertex  i.  Similarly,  p(e(j))  or  p(j),  the 
that  edge  e(j)  is  operative,  is  the  reliability  of 
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This  paper  is  concerned  with  reliability  problems  in 
communication  networks,  hence  the  more  suggestive  terminology 
nodes  and  links  will  be  used  for  the  generic  terms  vertices  and 
edges.  In  addition,  the  notation  v(i)  and  e(j)  will  be 
simplified  by  suppressing  the  "v"  and  "e"  unless  necessary  for 
clarity.  Network  will  be  used  instead  of  graph,  with  the 
understanding  that  the  network  could  be  a  digraph.  The 
particular  layout  of  the  network  v/ill  be  called  the  network 
topology.  A  network  is  represented  through  the  use  of  a  picture 
in  the  usual  way;  an  example  is  Figure  1.  Note  that  the  links 
from  node  1  are  directed  while  all  other  links  are  not  directed; 
therefore  a  message  can  be  sent  from  any  node  except  4,5  and  6  to 
any  other  node  if  all  nodes  and  links  are  operative.  N,  the 
number  of  nodes  in  the  network,  equals  12  in  this  example. 


FIGURE  1.  An  example  of  a  network. 


5  0 
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One  example  of  a  network  reliability  problem  is  as  follows. 

Let  G  be  a  network  with  known  node  and  link  reliabilities.  What 
is  the  probability  that  the  network  is  connected? 

This  problem  and  most  other  network  reliability  problems  are  at 
least  as  hard  as  the  NP-complete  problems.  Hence  the  existance 
of  an  algorithm  which  solves  this  problem  in  polynomial  time 
would  imply  the  existence  of  polynomial  time  algorithms  for  all 
NP-complete  problems. 

This  is  the  fundamental  problem  with  virtually  all  network 
reliability  problems;  no  one  has  found  polynomial  time  algorithms 
which  solve  them,  and  furthermore  most  researchers  feel  that 
these  problems  are  intractable. 
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IV.  A  TRACTABLE  NETWORK  RELIABILITY  PROBLEM 


The  following  is  given: 

1)  the  network  topology 

2)  the  reliability  of  all  nodes  and  links. 

The  objective  is  to  route  messages  by  the  most  reliable  path. 

Any  network  with  node  and  link  failures  are  can  be  transformed 
into  a  network  with  only  link  failures.  Replace  every  node  i  in 
the  original  network  with  two  nodes  i^  and  i^;  the  reliability 
of  the  original  node  is  assigned  to  the  link  connecting  the  nodes 
in  the  new  network.  Thus,  if  p(v(i))  =  p  in  the  original  network 
then  in  the  new  network  the  probabi"  ty  of  the  link  between  i^ 
and  being  operative  is  is  p ( (v ( i ,  v ( i^ )  )  =  p.  An  example 
follows . 

FIGURE  2.  Replacement  of  node  reliability  by  link  reliability 


/ 

/ 

/ 

>  O - >  becomes 

node  i 


/ 

/ 

/ 

>  0 - >  0 - > 

node  i^  node  i^ 


20-8 


Thus,  without  loss  of  generality,  we  can  make  the  very  useful 
assumption  that  any  network  has  only  link  failures.  Suppose 
there  exists  such  a  network.  Then 

let  W(s,t)  =  {  w(s,t)  I  w(s,t)  is  a  path  from  node  s  to  node  t  ). 

Let  R(s,t)  =  max  {  P(e(j))  }. 

all  e(j)  in 
w(s,t)  in  W(s,t) 

R(s,t)  is  the  reliability  of  the  most  reliable  path  from  node  s 
to  node  t;  i.e.  the  path  with  the  largest  product  link 
probabilities  (a  subset  of  which  were  node  probabilities  in  the 
original  network).  The  most  reliable  path  is  denoted  w*(s,t). 

Let  d(e(j))  =  -  In  (  p(e(j))  )  and  D(s,t)  =  -  In  (  R(s,t)  ). 

Then 

D(s,t)  =  min  {  d(e(j))  } 

all  e(j)  in 
w (s , t)  in  W (s , t) 

is  the  "shortest”  route  from  s  to  t  in  the  network  with  distances 
given  by  the  function  d  above. 

This  is  the  well  known  shortest  route  problem  and  can  be  solved 
very  efficiently  with  existing  algorithms. 

The  algorithm  presented  below  is  a  modification  of  Dijkstra's 
label  setting  algorithm  for  finding  the  shortest  route  between 
nodes  in  a  network.  For  large  sparse  matrices,  the  Bellman-Moore 
label  correcting  algorithm  as  improved  by  d'Esopo  and  coded  by 
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Pape  [1980]  appears  to  be  faster.  Recent  work  by  Glover  [1984] 
indicates  that  THRESH,  a  hybrid  of  label  setting  and  label 
correcting  algorithms,  is  the  currently  fastest  available 
algorithm  in  general.  An  excellent  reference  fc  this  area  is 
Syslo  [1983];  it  includes  all  but  the  latest  work  Ly  Glover. 

Assume  now  that  there  exists  a  directed  network  with  only  node 
failures.  While  this  assumption  is  made  primarly  for  ease  of 
exposition,  it  is  not  necessarily  a  bad  assumption  for  a  model  of 
a  communications  network  based  in  space.  The  probability  of  node 
i  being  operative  is  denoted  p(i).  The  nodes  are  labelled 
1 , 2  ,  .  .  .  ,  N . 

ALGORITHM 

/  0  if  there  is  no  link  from  i  to  j 

Let  u(i,j)  =  < 

\_  P(j)  if  there  is  a  link  from  i  to  j 

and  T  =  N  (the  set  of  nodes) . 

1)  Set  r(s)  =  1  and  r(j)  =  u(s,j)  for  j  >  1  where  j  is  an  element 
of  T  =  N  -  (s).  The  set  T  is  labelled  the  set  of  temporary 
nodes . 
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2)  Find  i  in  T  such  that 

r(i)  =  max  {  r(j)  i  j  is  in  T  ). 

3)  Set  T  < -  T  -  (i}.  If  T  is  the  empty  set,  then  stop; 

else  go  to  step  4.  Node  i  is  labelled  permanent. 

4)  For  each  j  in  T,  set 

r(j)  < -  max  {  r(j)  ,  r(i)u(i,j)  } 


5)  Go  to  Step  2. 

Optimal  routes  are  generated  by  recording  the  nodes  which  solve 
the  maximization  problem  in  Step  4)  above.  The  time  complexity 

3 

of  this  algorithm  is  0(N  ),  which  includes  finding  the  most 
reliable  path  from  every  node  to  every  other  node. 
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V.  RELIABILITY  MEASURES  AND  ALGORITHMS 

Network  reliability  analysis  is  very  difficult  for  a  number  of 
reasons.  First  of  all,  it  is  not  easy  to  even  define  what  the 
problem  is.  Secondly,  all  of  the  measures  presented  below  result 
in  problems  that  are  at  least  as  hard  as  the  NP-complete 
problems.  In  addition,  solution  algorithms  do  not  lend 
themselves  to  optimization  of  network  design,  but  rather  to  a 
description  of  proposed  topologies.  In  practice,  a  combination 
of  analysis  and  simulation  is  used  to  try  to  produce  one  network 
design.  In  Sections  VII  and  VIII,  some  useful  analytic  tools  are 
developed. 

There  is  a  large  growing  literature  in  network  reliability.  One 
or  more  of  the  following  reliability  measures  have  appeared  in 
many  recent  papers.  (See  Bibliography  section  on  reliability.) 

1)  the  probability  that  all  nodes  are  communicating 

2)  the  probability  that  all  operative  nodes  are  communicating 

3)  the  probability  that  all  operative  nodes  are  communicating 

with  a  given  node 

4)  the  expected  number  of  nodes  communicating 

5)  the  expected  number  of  nodes  communicating  with  a  given  node 

6)  the  expected  number  of  node  pairs  communicating  with  a  given 
node 

7)  the  probability  that  the  system  operates 
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Two  of  the  best  papers  are  Ball  [1979]  and  Ball  and  Nemhauser 
[1979].  Ball  [1979]  specialized  algorithms  v/hich  calculate  the 
first  six  measures  for  networks  in  which  only  nodes  can  fail. 

The  foundation  of  these  and  virtually  all  other  reliability 
algorithms  is  clever  partitioning  of  the  sample  space  and 
appropriate  conditioning  of  events. 

The  measure  addressed  here  is  1) ,  the  probability  that  all  nodes 
are  communicating.  With  only  link  failures,  this  is  also  the 
probability  that  the  network  is  connected. 
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VI.  TOPOLOGIES 


Loop  topologies  have  received  a  fair  amount  of  attention  in 
recent  years.  There  are  a  series  of  papers  which  deal  with  loop 
topologies  set  up  without  a  central  control  node.  This  enables 
routing  schemes  to  be  set  up  which  are  adaptive  to  changes 
(additions  or  deletions)  in  the  network.  Loop  topologies  can  be 
used  in  networks  ranging  from  local  area  networks  to  world-v/ide 
satellite  communication  networks. 

Saltzer  [1981]  found  some  good  reasons  for  using  a  ring  (or  loop) 
network  in  local  area  networks.  Raghavendra  [1981]  proved  that 
the  optimal  loop  topology  should  have  forward  short  links  to  the 
next  node  and  backward  long  links  to  the  node  1_  SQRT  (N)  _1  nodes 
away.  Optimal  here  means  that  this  topology  minimizes  the 
maximum  distance  between  nodes.  Claims  are  made 
for  optimality  in  terms  of  reliability,  but  the  reliability 
measure  used  is  very  primitive. 

Brayer  [1984]  took  this  result  and  a  routing  algorithm  from 
Chyung  [1975]  to  develop  an  algorithm  which  automatically  updates 
routing  v;hen  there  are  either  additions  to  or  deletions  from  the 
network.  Raghavendra  [1985]  studied  the  performance  of  the 
double  loop  network  topology  using  a  variety  of  adaptive  routing 
algorithms.  He  compared  utilization  and  delay  figures  derived 
from  a  simple  queueing  model  to  estimated  values  based  on  a 
series  of  simulations. 
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In  what  follows,  loop  topologies  follow  the  optimal  layout,  but 
with  no  directed  links.  (If  the  nodes  are  drawn  in  a  circle  and 
connected  as  below,  it  is  clear  why  the  term  loop  topology  is 
used.)  A  city  topology  is  one  which  resembles  the  classic  street 
structure  of  a  number  of  cities.  Note  also  that  a  city  topology 
can  be  regarded  as  a  subset  of  a  loop  topology,  where  the 
boundary  nodes  do  not  have  the  full  connection  of  a  loop 
topology.  The  distance  between  two  nodes  along  a  given  path  is 
measured  by  the  number  of  links  in  the  path,  in  both  topologies. 
The  topologies  do  not  necessarily  have  to  be  square. 
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city  Topology 

The  graph  below  is  described  as  a  5  by  5  city  topology.  Rows 


and  columns  refer 
nodes  1  through  5 
column  is  1,6,11, 

to  the  obvious:  the  first  row 
,  the  second  6  through  10  and 
16,21. 
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Loop  Topology 

The  graph  below  is  described  as  a  5  by  5  loop  topology. 
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VII .  AN  ANALYTIC  APPROXIMATION 


There  are  N  nodes  in  a  connected  network;  at  node  i  there  are 
m(i)  outbound  links  and  link  (i,j)  has  reliability  p(i,j).  Then 
the  network  reliability,  the  probability  that  the  network  is 
connected  (P^)  is  approximately: 

N  m(i) 

I  I 

II  (  1  -  II  (  1  -  P(i,j))  ) 

II  II 

i=l  j=l 

This  approximation  is  based  on  defining  non-uniform  Bernoulli 
trials  at  every  node,  where  a  trial  is  "observing  an  outbound 
link"  and  a  success  is  "the  link  working." 

This  approximation  was  compared  to  actual  values  calculated  using 
an  algorithm  in  Ball  [1979],  The  code  for  the  algorithm  was 
graciously  supplied  by  Dr.  Ball.  Note  that  the  probability  is 
reliability  measure  1)  of  Section  V. 

Some  typical  results  are  illustrated  by  the  following  graphs. 

The  vertical  axis  is  P  and  the  horizontal  axis  is  the  single 

c 

link  (and  node,  when  applicable)  reliability,  assumed  to  be  the 
same  for  all  links  (and  nodes,  when  applicable).  A  number  of 
loop  and  city  topologies  were  considered.  The  legends 
"Predicted"  and  "Actual"  refer  to  the  approximation  and  the  Ball 
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algorithm  respectively.  Going  beyond  the  size  5  by  5  was 
impractical  for  the  Ball  algorithm;  this  is  due  to  the  problem 
complexity  discussed  in  Section  V,  The  approximation  does  quite 
well  for  this  class  of  topologies,  and  even  when  the 
approximation  is  at  its  worst,  it  still  finds  the  relatively  flat 
part  of  each  curve  where  reliability  values  are  close  to  one. 
This  is  useful  when  designing  such  networks,  as  it  provides  a 
range  of  link  and  node  reliabilties  that  result  in  essentially 
equivalent  netv;ork  reliability. 
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VIII.  NETWORK  DESIGN 


One  network  design  problem  can  be  stated  as  follows. 

Given  a  limit  to  the  number  of  links  which  can  come  out 
of  any  node,  what  is  the  topology  which  maximizes  the 
probability  that  the  network  is  connected? 


This  problem  is  formulated  as  PROBLEM  I. 


PROBLEM  I 

Let 

i  and  j  be  nodes,  i  not  the  same  as  j  throughout 


/  1  if  there  is  a  link  from  i  to  j 

x(i,j)  =  < 

\_  0  if  there  is  no  link 

P(i/j)  =  probability  that  link  i,j  is  up 


Then 


max 


N 


N 


I  I 


I  (  1  -  P(i/ j ) ) 


x(i, j) 


i=l 


j  =  l 


N 

subject  to  / _  x(i,j)  <=  a(i) 

j  =  l 


X  ( i , j )  =  0  or  1 
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The  objective  function  is  separable,  which  allows  for  easy 
analysis.  Also,  a  number  of  the  results  from  Kolitz  [1987]  can 
be  applied  to  this  problem;  conclusions  follow. 

1)  Use  a  maximum  marginal  return  algorithm,  essentially  Algorithm  I 
in  Kolitz  [1987],  to  optimally  assign  links. 

2)  If  p(i,j)  is  constant  for  all  i  and  j,  then  the  number  of 
links  from  any  node  should  differ  by  no  more  than  one  from  the 
number  of  links  from  any  other  node. 

3)  The  optimal  design  equalizes  reliability  at  each  node;  that 
is,  the  probability  of  at  least  one  link  being  up  at  each  node  is 
the  same. 


Another  network  design  problem  can  be  stated  as  follows. 

Given  a  minimum  necessary  level  of  reliability  for  a 
network,  what  is  the  topology  which  minimizes  the  cost  of 
building  the  network? 

This  problem  is  formulated  as  PROBLEM  II.  Problem  II  is 
currently  being  studied. 
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PROBLEM  II 


Let 

i  and  j  be  nodes,  i  not  the  same  as  j  throughout 
/  1  if  there  is  a  link  from  i  to  j 

x{i,j)  =  < 

\_  0  if  there  is  no  link 

P(i/j)  =  probability  that  link  i,j  is  up 

c(i,j)  =  cost  of  link  i,j 

R  =  the  required  level  of  network  reliability 
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IX.  SATELLITE  NETWORK  ARCHITECTURES 


Network  reliability  (P  )  was  calculated  for  two  satellite  network 
architectures.  In  both  architectures,  the  orbital  height  is  2100 
kilometers,  the  satellites  are  in  polar  orbit  and  there  arc  two 
orbital  planes  (rings) .  Also,  spacing  is  such  that  there  is  a 
constant  time  between  satellites  passing  over  the  North  Pole. 

One  architecture  has  six  satellites  per  ring  and  is  denoted  2  by 
6  or  2x6;  the  other  has  nine  satellites  per  ring  and  is  denoted  2 
by  9  or  2x9.  Network  topology  is  determined  by  the  geometry. 
Links  are  present  whenever  two  satellites  are  in  line  of  sight, 
subject  to  the  line  being  a  given  minimum  distance  above  the 
earth's  surface.  Three  minimum  distances  are  used:  0,  100  and 
700  kilometers. 

Because  of  the  geometry,  patterns  repeat  in  cycles  throughout  a 
complete  orbit  of  any  one  satellite.  The  horizontal  axis  is 
time,  expressed  as  the  proportion  of  a  cycle;  a  complete  orbit's 
graph  would  consist  of  repetitions  of  the  graphs  presented.  For 
display  purposes,  the  points  are  connected  by  straight  lines  to 
emphasize  the  changes  in  P  as  a  function  of  time.  The  legends 
0.99,  0.95,  etc.  are  the  single  link  reliability  values. 
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X.  ROUTING 


The  following  algorithm  is  useful  for  making  routing  decisions 
when  links  and  nodes  are  subject  to  failure  and  all  link 
distances  are  one.  It  is  very  efficient  when  the  network  has  the 
city  topology 


ALGORITHM 


I.  Find  all  shortest  length  paths  from  s  to  t  for  all  nodes  s 
and  t. 

In  the  case  of  the  the  city  topology,  these  paths  are  trivial 
to  compute. 


II.  Keep  track  of  how  many  paths  there  are  for  each  pair,  and  index 
each  path  by  node  and  link. 

In  the  case  of  the  city  topology,  the  number  of  shortest 
paths  can  be  computed  at  the  start  using  the  following 
formula : 


20-35 


number  of  shortest  paths  from  i  to  j  =  number  of 

combinations  of  A  things  taken  B  at  a  time  where 

A  =  distance  between  i  and  j 

=  |row(i)  -  row(j)|  +  |col(i)  -  col(j)| 

B  =  minimum  (  Irow(i)  -  row(j) |  ,  |col(i)  -  col(j) |  ) 


III. Let  N(t)  =  (i:  d(i,t)  =  1};  N(t)  is  the  set  of  nodes  one  link 
away  from  t. 

When  a  node  or  link  goes  down,  delete  all  paths  which  contain 
it  and  update  the  number  of  shortest  length  paths  for  every 
node  pair.  If  no  shortest  length  paths  are  left,  then  form  a 
new  set  of  shortest  length  paths  by  concatenating  the 
destination  node  to  the  shortest  length  paths  from  s  to  N(t). 

Go  to  II. 


The  advantage  of  this  algorithm  over  the  standard  algorithms, 
which  are  very  fast  for  the  special  structure  here,  is  that  all 
feasible  shortest  paths  are  always  known.  Thus,  routing  can  be 
done  using  other  criteria,  such  as  equalization  of  traffic 
density,  over  different  shortest  paths.  The  problem  is  changed 
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from  one  of  computation,  using  the  classical  methods,  to  one  of 
accessing  a  (potentially  large)  database.  The  algorithm  was 
coded  and  tested  on  city  topologies,  with  good  results. 
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ABSTRACT 


As  electronic  systems  become  more  complex,  testability  is 
becoming  an  increasingly  important  design  consideration.  To 
incorporate  testability  properly  during  the  design  phase,  an 
appropriate  testability  analysis  tool  must  be  selected.  In  this 
report,  we  address  the  necessity  for  including  testability  in  the 
design  phase  and  develop  a  set  of  criteria  to  compare  testability 
analysis  tools.  We  analyze  several  tools  with  respect  to  the 
established  criteria.  Finally,  we  conclude  with  the  summary  and  the 
recommendations  regarding  the  selection  of  a  suitable  testability 
analysis  tool  for  design  for  testability  (DFT) . 
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I.  INTRODUCTION 


Recent  USAF  programs  have  sought  to  address  the 
enhancement  of  both  built-in  and  off-line  testing  of  new 
systems  early  in  the  acquisition  phase.  This  requir  s  the  inclusion 
of  testability  concepts  in  the  design  phase.  It  has  the  goal  of 
eliminating  interim  contractor  support  (ICS)  and  thus  reducing  the 
life-cycle  cost  (LCC)  of  the  system.  In  addition  to  reducing  the 
LCC  of  the  system,  the  incorporation  of  testability  in  the  design 
phase  also  increases  the  reliability,  maintainability,  and 
availability  (RM&A)  of  the  system.  The  potential  monetary  savings 
and  the  increased  quality  of  the  system  make  the  inclusion  of 
testability  in  the  design  phase  very  attractive.  Testability  must 
be  included  at  the  lowest  level  of  design  to  be  effective.  For 
electronic  systems,  this  means  that  the  testability  criteria  should 
be  applied  at  the  circuit  card  assembly  (CCA)  level.  This  is 
becoming  increasingly  more  important  with  the  increased  CCA 
complexity  of  modern  electronic  systems.  The  task  of  including 
testability  in  the  design  phase  of  the  CCAs  lends  itself  to 
computer-aided  design  (CAD)  tools.  To  ensure  an  accurate 
verification  of  testability  requirements,  a  proper  CAD  tool  must  be 
chosen  that  is  suitable  for  the  job  at  hand.  The  previous  studies 
of  testability  analysis  tools  have  dealt  mainly  with  system  level 
analysis,  but  not  with  the  board  level  analysis  required  by  modern 
electronic  systems.  The  other  board  level  studies  were  performed 
before  most  of  today's  CAD  tools  were  even  in  existence.  Also  the 
growing  number  and  added  maturity  of  CAD  tools  requires  a  new  study 
to  be  performed. 

In  this  report  we  first  establish  the  objectives  of  our 
research  and  provide  support  for  the  research  in  the  form  of  an 
overview  of  the  design  for  testability  philosophy.  We  then  state 
and  support  the  criteria  and  method  by  which  we  evaluate  each 
testability  tool.  A  section  is  devoted  to  each  tool  that  briefly 
describes  how  each  software  meets  the  established  criteria.  These 
descriptions  should  not  be  considered  complete  overviews  of  the 
tools.  For  details  beyond  the  scope  of  this  report,  any  interested 


individuals  should  contact  the  individual  vendors.  We  conclude  with 
a  summary  of  the  research  and  our  recommendations. 

II.  OBJECTIVES  OF  THE  RESEARCH  EFFORT 


The  objective  of  our  research  was  to  analyze  and  compare 
several  testability  analysis  tools  as  to  how  they  perform  a  board 
level  testability  analysis  of  electronic  systems.  This  involved 
evaluating  several  aspects  of  each  tool  from  ease  of  input  to 
clarity  and  accuracy  of  the  outputs.  Another  main  goal  of  this 
research  was  to  determine  one  or  more  Figures  of  Merit (FOM)  that 
can  be  used  to  set  testability  requirements.  Since  there  is  no 
standard  method  for  obtaining  an  FOM,  the  resulting  FOM  from  each 
tool  is  unique.  The  proper  FOM  would  be  one  which  describes  the 
testability  of  a  CCA  in  the  fewest  possible  parameters  with  the 
most  insight  into  its  testability.  To  get  an  accurate  evaluation, 
the  CAD  tools  were  tested  with  a  variety  of  circuits,  namely 
digital,  analog,  and  hybrid  circuits.  For  our  study  to  be 
realistic,  it  was  carried  out  using  actual  CCAs  selected  from  the 
E-3  AWACS  aircraft. 


III.  DESIGN  ^OR  TESTABILITY 

Testability  addresses  the  issues  involved  in  achieving  the 
inherent  ability  in  a  system  to  be  testable  so  that  desired  levels 
of  fault  detection  and  isolation  can  be  achieved  in  an  accurate, 
timely,  and  cost-effective  manner.  The  testing  of  an  item  can  be 
influenced  by  the  physical  design  of  the  item,  the  electrical 
design  of  the  item,  and  procedures  and  equipment  used  to  test  the 
item.  With  the  introduction  of  sophisticated  test  equipment, 
lacking  physical  designs  can  be  overcome  with  guided  probe  and 
bed-of-nails  techniques.  However,  this  becomes  ineffective  in  test 
cost  and  time  required  to  test.  Even  with  the  sophistication  of  the 
test  equipment,  the  system  must  still  be  designed  in  such  a  manner 
that  it  is  testable.  This  is  known  as  being  inherently  testable.  As 
systems  become  more  costly  and  more  complex  with  higher  part 
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density  on  each  CCA,  the  ability  to  test  and  maintain  these  systems 
becomes  increasingly  difficult.  Only  if  a  system  is  highly  testable 
can  confidence  be  placed  in  the  fact  that  it  is  fault  free.  Once  a 
system  is  known  to  contain  a  fault,  the  ability  to  isolate  the 
fault  is  directly  related  to  the  inherent  testability  of  the 
system.  The  testability  of  the  system  is  one  of  the  leading  factors 
influencing  the  LCC  of  the  system.  This  is  because  the  maintenance 
of  a  system  is  influenced  by  its  testability  and  it  can  account 
for  50%  of  its  LCC. 

The  most  common  method  of  system  design  places  the  test 
engineer  at  the  end  of  the  design  phase.  Typically,  at  this  point, 
the  hardware  has  been  designed  and  is  at  some  point  of  production 
and  the  test  engineer  must  find  a  way  to  improve  the  testability  of 
the  system  through  probe  tests  and  other  intrusive  methods.  This 
method  is  time  consuming  and  quite  often  the  test  requirements 
cannot  be  met  due  to  a  lack  of  inherent  testability  of  the  design. 
The  decision  must  then  be  made  whether  to  redesign  the  hardware  or 
relax  the  test  requirements.  Since  the  hardware  is  being  produced 
in  some  fashion  at  this  point,  the  redesign  of  the  hardware  would 
be  very  costly  and  time  consuming.  On  the  other  hand,  the 
relaxation  of  the  test  requirements  would  reduce  the  testability  of 
the  system  and  thus  increase  its  maintenance  cost  and  hence  its 
LCC.  The  most  logical  solution  to  this  is  to  include  testability  in 
the  initial  design  phase  so  that  test  requirements  can  be  met 
before  the  system  enters  the  production  phase.  The  test 
rec[uirements  can  then  be  treated  as  additional  design  requirements 
for  the  system.  This  is  called  design  for  testability  (DFT) . 
Addressing  testability  issues  in  the  design  phase  enhances  the 
testability  of  a  system  thus  reducing  its  LCC. 

As  the  complexity  of  the  system  increases,  the  ability  of  a 
designer  to  incorporate  testability  by  the  old  time-consuming 
manual  methods  becomes  impossible.  Thus  MIL-STD-2165  becomes  more 
of  a  guideline  than  a  method  of  analyzing  testability.  At  the  early 
stages  of  the  design  phase,  the  designs  can  change  on  a  daily 
basis.  Therefore,  some  method  of  incorporating  these  changes 
rapidly  into  the  testability  analysis  becomes  a  necessity.  This 
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problem  lends  itself  quite  well  to  computer-aided  design (CAD) 
systems.  Therefore  CAD  tools  have  become  necessary  for  proper 
testability  analysis.  There  are  several  such  tools  available.  Some 
of  the  popular  ones  are:  CAFIT,  I-CAT,  STAMP,  ASTEP,  and  STAT.  Our 
main  concern  was  to  compare  these  tools  for  the  USAF.  ASTEP  was 
considered  in  the  original  proposal,  but  was  removed  from  this 
study  by  request  because  it  is  primarily  a  system  level  testability 
analysis  tool  and  it  is  not  usable  for  testability  analysis  at  the 
CCA  level. 


IV.  CRITERIA  FOR  TESTABILITY  ANALYSIS  TOOLS 

The  first  criteria  to  evaluate  was  the  input  requirements  of 
each  testability  tool  as  to  how  much  effort  is  required  to  prepare 
the  design  information  in  a  form  suitable  for  the  software.  This 
involved  evaluating  the  modeling  method  used  for  each  program. 
Other  aspects  such  as  the  flexibility  of  the  modeling  procedure  and 
the  time  required  to  do  the  modeling  should  also  be  included  in  the 
criteria.  The  next  criteria  evaluated  was  the  ease  with  which  the 
modeling  information  can  be  entered  into  the  program.  This  included 
the  user  interface  as  well  as  the  form  in  which  the  information  is 
presented  back  to  the  user.  The  data  should  be  presented  in  such 
a  way  as  to  allow  the  user  to  debug  the  information  quickly.  The 
error  checking  ability  and  the  clarity  of  the  error  messages 
presented  to  the  user  are  two  criteria  that  must  be  addressed  for 
tasks  of  the  size  being  considered  (i.e.  large  CCAs  such  as  the 
ones  we  used) .  The  run-time  of  the  programs  is  also  very  important 
since  one  would  not  wish  to  wait  overnight  for  the  analysir.  of  a 
single  CCA. 

The  primary  criteria  for  the  evaluation  of  the  testability 
analysis  tools  is  the  information  in  the  form  of  a  FOM  from  the 
output  of  each  tool.  The  output  should  be  concise  for  the  quick 
assessment  of  the  inherent  testability  of  the  CCA.  It  should  also 
be  detailed  enough  to  provide  insight  into  the  details  of  the 
testability  in  a  specific  portion  of  the  CCA.  This  information 
should  include  such  aspects  as  the  various  ambiguity  groups. 
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feedback  loops,  types  of  testing  strategy  to  apply,  etc.  In 
addition  to  the  quantitative  outputs,  some  qualitative  outputs 
such  as  suggestions  to  improve  the  testability  are  also  desirable. 
These  include  where  to  break  feedback  loops,  where  to  add  test 
points,  test  points  to  eliminate,  and  any  other  suggestions. 

In  addition  to  these  criteria,  some  other  aspects  of  the 
individual  programs  were  also  evaluated.  These  were  primarily 
features  regarding  the  testability  analysis  unique  to  a  particular 
program. 

Finally,  to  summarize  the  evaluation  criteria,  they  are  in  the 
following  order  of  importance: 

(1)  A  concise  FOM  yielding  a  measure  of  testability 

(2)  Suggestions  to  improve  testability  through  the  breaking 
feedback  loops 

(3)  Suggestions  to  improve  testability  through  the 
addition  or  elimination  of  test  nodes 

(4)  Information  on  ambiguity  groups  and  feedback  loops 

(5)  Testing  strategy  or  procedure 

(6)  Data  input  requirements 

(7)  Amount  of  modeling  required 

(8)  User  interface, 

V.  METHOD  OF  EVALUATION 


Our  primary  concern  in  the  evaluation  of  the  testability  tools 
was  how  the  tools  meet  the  criteria  established  in  the  previous 
section.  The  three  circuits  chosen  were  selected  to  give  a  feel  for 
how  the  tools  met  the  criteria  on  a  variety  of  real-world 
electronic  circuits.  Another  criterion  that  was  used  in  selecting 
the  circuits  was  the  experience  of  the  WRALC  personnel  in 
developing  the  depot  for  these  circuits.  The  actual  numerical 
results  are  of  little  consequence  since  each  circuit,  that  any  user 
will  analyze,  will  have  its  own  unique  characteristics.  How  the 
tools  met  the  challenge  of  evaluating  different  types  of  circuits 
was  of  a  greater  importance.  Since  every  circuit  will  yield  unique 
results,  the  numerical  results  themselves  cannot  be  used  to 
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evaluate  the  tools.  Each  tool  was  scored  on  a  scale  of  0  to  10  as 
to  how  effectively  each  of  the  criteria  was  met.  Each  criteria  was 
assigned  a  weighting  factor  based  on  its  importance  to  design  for 
testability  (DFT) . 

The  first  criteria  is  the  most  important  since  it 
characterizes  and  summarizes  the  testability  of  the  CCA  and  it 
received  a  weight  of  10.  The  second  and  third  criteria  were  both 
given  weights  of  9  since  they  are  the  primary  means  of  improving 
testability  through  design  changes.  The  fourth  criteria  was 
assigned  a  weight  of  8.  This  is  because  the  detailed  information 
can  be  used  to  improve  testability  even  though  no  direct 
suggestions  are  made.  The  fifth  criteria  is  more  limited  in  the 
information  that  it  can  provide  with  regard  to  the  design  procedure 
so  it  received  a  weight  of  6.  The  last  three  criteria  do  not  deal 
directly  with  the  testability  analysis  and  the  design  of  a  CCA. 
However,  they  are  still  important  because  even  an  effective  design 
tool  may  not  be  practical  in  its  use.  Thus,  each  of  the  last  three 
criteria  was  given  a  weight  of  5. 

Once  all  the  scores  were  assigned,  each  tool  had  its  weighted 
total  score  calculated  from  the  assigned  scores  and  weights.  Any 
evaluation  is  bound  to  be  subjective,  but  we  believe  that  our 
independence  from  any  government  agency  or  private  corporation 
limits  the  subjectiveness  as  much  as  possible. 

VI .  CAFIT 


CAFIT  stands  for  Computer-Aided  Fault  Isolation  and 
Testability  Model.  CAFIT  was  developed  by  ATAC  of  Mountain  View, 
CA.  It  is  a  government  owned  testability  analysis  tool.  CAFIT  was 
designed  to  enhance  the  testability  of  electronic  circuits,  both 
digital  and  analog.  It  provides  outputs,  after  performing  several 
forms  of  analysis,  that  aid  in  designing  circuits  to  be  inherently 
testable.  For  this  study,  CAFIT  was  run  on  an  IBM-PC  AT  compatible. 
It  requires  an  EGA  compatible  monitor  and  occupies  approximately  2 
Mbytes  of  hard  disk  space.  An  additional  1.4  Mbytes  of  disk  space 
is  taken  up  by  the  library,  but  this  will  vary  depending  on  the 
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number  of  device  models  used.  It  can  also  be  run  on  Mentor-Graphics 
workstations. 

CAFIT  provides  a  library  utility  that  builds  and  maintains  a 
model  library  of  parts.  This  library  contains  all  of  the  logic, 
attribute,  failure  rate,  and  pin  information  for  each  device  in  the 
library.  The  user  can  modify  and  add  devices  easily  to  the  library 
with  the  help  of  the  reference  manual.  Digital  devices  can  be 
entered  easily  from  the  data  sheet  of  most  devices  with  slight 
modifications  into  the  format  required  by  CAFIT.  For  the  digital 
devices,  the  model  contains  the  actual  function  of  the  device  in 
terms  of  HIGH,  LOW,  TRI -STATE,  etc.  For  analog  devices,  no 
functional  information  is  used.  Thus  the  analog  model  is  purely 
topological.  This  mixture  also  allows  for  the  modeling  of  hybrid 
components  such  as  D/A  and  A/D  converters.  Care  must  be  taken  when 
using  analog  devices  to  ensure  that  the  proper  dependency  model  is 
used. 

CAFIT  is  designed  to  accept  schematic  information  from  a 
net list  provided  by  a  CAE  package  such  as  the  CT-2000  software 
package  from  Case  Technologies,  Inc.  If  such  a  package  is  not 
available,  the  connectivity  information  from  the  schematic  can  be 
entered  into  CAFIT  through  an  ASCII  file  in  the  proper  format.  We 
used  this  method  since  the  CAE  tool  was  not  available  for  our  use. 
CAFIT  takes  this  information  along  with  the  dependency  model 
information  previously  entered  to  do  the  testability  analysis.  If 
the  CAE  system  is  available,  results  are  shown  in  color  format  on 
the  schematic.  Otherwise  the  output  files  provide  the  means  to  view 
the  results.  Input  for  the  ASCII  file  is  easily  obtained  directly 
from  the  schematic  and  this  does  not  require  any  special  knowledge 
of  the  inner  workings  of  the  circuit  if  the  circuit  is  purely 
digital.  However,  if  analog  components  are  used,  then  the  knowledge 
of  the  operation  of  the  circuit  is  essential  to  get  the  proper 
topological  models.  The  only  constraint  on  the  labeling  of  nodes  is 
in  the  labeling  of  test  nodes,  input  signals,  and  output  signals. 
Test  node  labels  must  have  a  TS  prefix  while  signal  input  and 
output  signals  must  have  a  SG  prefix.  Overall,  CAFIT  requires  very 
little  preparation  of  the  input  information  if  the  models  have 
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already  been  created  within  CAFIT.  This  will  most  often  be  the  case 
once  a  library  of  devices  has  been  established. 

Performing  a  basic  testability  analysis  with  CAFIT  for  an 
existing  design  is  relatively  easy.  A  basic  analysis  attempts  to 
locate  structures  inherent  in  the  design  that  will  inhibit  the 
testing  of  the  design.  These  structures  include  negative 
reconvergence,  logic  redundancy  and,  most  importantly,  the 
information  on  feedback  loops  present  in  the  design.  Negative 
reconvergence  is  a  design  condition  in  digital  systems  where  a 
portion  of  logic  prevents  an  input  from  controlling  an  output.  This 
condition  is  not  necessarily  a  design  flaw,  but  it  can  seriously 
impede  the  testing  of  the  circuit.  Logic  redundancies,  which 
inhibit  the  testing  of  the  circuit,  are  also  identified.  With  logic 
redundancy,  the  observability  of  the  circuit  is  reduced.  The 
feedback  loop  portion  of  the  report  is  probably  the  most  useful. 
This  provides  a  detailed  output  listing  of  all  the  components 
involved  in  each  loop  as  well  as  nodes  at  which  to  break  each  loop. 
The  next  type  of  analysis  provided  by  CAFIT  allows  the  user  to 
select  parameters  about  the  test  nodes  and  perform  a  detailed 
testability  analysis  with  regard  to  signal  controllability  and 
observability. 

If  the  test  nodes  have  already  been  selected,  an  analysis  can 
be  done  to  determine  signal  coverage  with  only  these  nodes.  The 
signal  coverage  of  a  device  is  a  measure  of  whether  or  not  stuck-at 
faults  in  device  can  be  tested  for.  This  is  a  function  of 
controllability,  observability,  and  the  number  of  tests  permitted. 
Controllability  and  observability  figures  using  the  signal  input 
and  output  and  test  sites  are  given  along  with  the  signal  coverage 
for  each  device.  In  addition  to  the  selected  test  nodes,  an 
additional  set  of  nodes  can  be  chosen  by  the  user.  CAFIT  then 
selects  the  nodes  that  will  yield  the  highest  signal  coverage.  If 
100%  coverage  can  be  achieved  with  fewer  test  nodes,  then  the 
lowest  number  of  nodes  are  chosen.  The  test  node  selection  is 
probably  one  of  the  most  powerful  aspects  of  the  analysis  that 
CAFIT  provides.  This  allows  the  user  to  experiment  with  various 
configurations  to  determine  the  optimal  number  of  test  nodes  for  a 
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given  CCA.  The  number  of  test  signals  is  an  additional  parameter 
that  can  also  be  varied.  This  constraint  is  becoming  less  prevalent 
as  sophisticated  test  equipment  becomes  commonplace.  This  is  also 
more  applicable  to  digital  circuits  where  CAFIT  has  the  knowledge 
of  the  various  states  of  the  circuit  from  the  model  description. 
Analog  circuits  may  require  only  one  or  possibly  several  tests  to 
be  performed  on  a  single  node,  depending  on  the  circuit. 

The  outputs  from  CAFIT  can  be  used  for  the  selection  of  test 
nodes,  the  selection  of  points  to  break  feedback  loops,  and  the 
estimation  of  controllability  and  observability.  However,  most 
testability  requirements  are  in  terms  of  fault  isolation  parameters 
and  ambiguity  group  sizes.  Even  though  a  node  is  controllable  and 
observable,  it  may  not  have  a  unique  test  that  identifies  only  that 
node,  leading  to  the  creation  of  ambiguity  groups.  This  is 
especially  true  when  dealing  with  feedback  loops.  The  entire  loop 
is  an  ambiguity  group  even  though  each  device  may  by  controllabxe 
and  observable.  This  is  especially  important  in  the  testing  of 
analog  and  hybrid  circuits  where  feedback  networks  are  invariably 
used.  The  information  provided  through  controllability  and 
observability  is  not  enough  to  describe  the  testability  of  an 
analog  or  a  hybrid  circuit  thoroughly. 

The  user  interface  for  CAFIT  is  one  of  the  easiest  to  use  and 
understand.  It  is  designed  in  such  a  way  that  even  an  inexperienced 
user  can  use  the  program.  On  screen  help  may  be  called  up 
throughout  the  program.  CAFIT  provides  information  on  the  screen 
without  cluttering  it  up.  The  error  messages  provided  were 
descriptive  and  aided  in  the  debugging  of  the  various  circuits 
tested.  The  only  shortfall  in  the  user  interface  is  having  to 
prepare  an  ASCII  file  if  the  CAE  system  is  not  available.  It  is 
acknowledged  that  CAFIT  was  designed  to  be  used  with  an 
accompanying  CAE  system,  but  a  provision  for  entering  the  schematic 
information  directly  into  CAFIT  and  having  CAFIT  manage  the  files 
would  have  been  helpful.  CAFIT  also  writes  the  output  data  to  one 
file  only  without  the  option  to  change  the  name  of  the  file.  Thus, 
after  each  run,  it  overwrites  the  existing  file  making  it  necessary 
to  continually  rename  the  output  file  after  each  run.  This  is  not 
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a  serious  drawback,  but  it  can  be  frustrating  after  waiting  45 
minutes  for  results  and  then  realizing  previous  results  were 
accidentally  overwritten. 

CAFIT  is  primarily  geared  toward  the  analysis  of  digital 
circuits.  It  can  analyze  analog  and  hybrid  circuits  also.  However, 
the  terminology  used  by  the  program  and  the  user's  manual  suggest 
that  the  analog  portion  has  been  added  as  an  afterthought.  The  lack 
of  fault  isolation  and  ambiguity  group  parameters  limits  the 
usefulness  of  the  output  to  verify  whether  the  desired  levels  of 
testability  are  being  met.  However,  the  test  node  selection  and 
feedback  loop  breakpoint  analysis  make  CAFIT  a  tool  for  choosing 
test  nodes  and  making  changes  in  the  circuit  co  increase  its 
inherent  testability. 


CRITERIA  and  SCORES  for  CAFIT: 

(1)  5  :  The  controllability  and  observability  figures  alone  are 

not  enough  to  fully  characterize  the  testability  of  a 
circuit. 

(2)  8  :  The  suggestions  for  selecting  feedback  loop  breakpoints 

are  essential. 

(3)  8  ;  The  time  spent  selecting  appropriate  test  nodes  is 

greatly  reducec  with  CAFIT  suggesting  the  most  effective 
test  nodes. 

(4)  3  :  The  information  on  feedback  loops  is  helpful,  but  there 

is  a  total  lack  of  information  on  ambiguity  groups. 

(5)  0  :  No  outputs  concerning  testing  procedure. 

(6)  5  :  The  existence  of  the  interface  with  the  CAE  tool  can  make 

data  input  simple.  However,  the  alternative  of  having  to 
format  an  ASCII  file  with  all  of  the  circuit  information 
is  tedious. 

(7)  6  :  Minimal  modeling  is  required  once  a  library  of  devices  is 

established.  However,  modeling  of  analog  circuits  must 
still  be  done  on  an  individual  basis. 

(8)  8  :  The  user  interface  is  simple  and  easy  to  use  and  has 

plenty  of  on-line  help. 
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VII.  STAT 


STAT  is  a  product  of  Detex  Systems,  Inc.  of  Orange,  CA.  STAT, 
along  with  its  predecessor  LOGMOD,  is  a  testability  analysis  tool 
designed  to  enhance  the  testability  of  systems.  This  applies  to  the 
enhancement  of  the  testability  of  new  systems  through  their  design 
and  for  the  maintenance  of  existing  systems.  Our  study  will  focus 
on  the  design  aspects  of  its  capability.  STAT  runs  on  a  variety  of 
PC  platforms,  but  a  minimum  of  a  286-based  PC  is  recommended  with 
a  sufficiently  large  hard  drive  to  store  the  STAT  program  and  all 
of  the  user's  databases.  It  requires  approximately  2.5  Mbytes  of 
disk  space  for  the  program  itself.  The  basis  for  the  analysis  that 
STAT  performs  is  the  dependency  m.ode]  nformation  supplied  by  the 
user. 

A  dependency  model  is  one  that  simply  shows  the  relationships 
(dependencies)  between  the  various  nodes  and  items  in  a  system. 
This  can  be  interpreted  as  describing  the  information  flow  within 
a  general  system  and  as  a  signal  flow  for  an  electronic  system.  The 
topological  information  from  a  circuit  is  entered  into  STAT 
completely  in  the  form  of  dependency  models,  not  as  the  connections 
between  the  actual  electronic  components.  This  type  of  modeling  has 
advantages  as  well  as  disadvantages.  The  main  advantage  for 
electronic  circuits  is  that  it  bridges  the  gap  between  the 
descriptions  of  digital  and  analog  circuits.  Another  advantage  of 
the  dependency  modeling  for  electronic  circuits  is  that  it  provides 
considerable  flexibility  to  describe  a  particular  circuit.  This 
includes  what  aspects  of  a  particular  signal  are  important  as  well 
as  the  level  of  modeling  for  each  portion  of  the  circuit.  For 
example,  it  allows  the  user  to  choose  whether  or  not  to  consider 
packages  containing  more  than  one  device  to  be  considered  as 
indiv  duals  or  as  a  single  item.  Being  able  to  describe  the 
particular  aspects  of  one  signal  as  individual  signals  is 
particularly  in  analog  circuits  where  biasing  and  actual  signal 
information  must  be  considered  separately.  The  disadvantage  of  this 
type  of  modeling  is  that  it  requires  an  intimate  knowledge  about 
the  operation  of  a  particular  circuit  in  order  for  it  to  be 
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accurately  modeled.  Modeling  a  complex  analog  or  hybrid  circuit  can 
be  a  time  consuming  and  frustrating  task  even  when  the  circuit  is 
familiar  to  the  user.  However,  the  proper  modeling  up  front  will 
help  to  reduce  future  problems  and  the  time  required  to  perform  a 
proper  testability  analysis  using  STAT. 

The  entering  of  the  dependency  model  information  into  STAT  is 
very  straight-forward.  The  information  is  entered  in  an  interactive 
fashion  with  STAT  updating  and  maintaining  the  databases 
transparently.  Each  node  in  the  dependency  model  is  entered  as  a 
test  node.  One  aspect  of  STAT,  that  is  particularly  frustrating,  is 
that  all  node  labels  must  have  a  'T'  prefix  and  all  item  labels 
must  have  an  'I'  prefix.  This  requires  the  user  to  maintain  some 
type  of  translation  table  independently.  Being  able  to  use 
descriptive  labels  would  aid  in  interpreting  the  results  of  the 
testability  analysis.  Descriptions  can  be  added  to  the  dependency 
information  as  it  is  entered.  The  dependency  model  information  is 
displayed  back  to  the  user  in  a  manner  that  allows  errors  to  be 
seen  visibly  and  corrected.  Once  the  basic  dependency  information 
has  been  entered,  the  model  can  then  be  processed  to  find  and 
correct  any  additional  errors.  The  error  messages  supplied  are 
descriptive  and  they  can  be  corrected  easily. 

The  evaluation  of  a  model  with  STAT  is  not  a  one  step  process, 
but  rather  a  series  of  processes.  When  a  model  is  first  processed, 
all  nodes  are  considered  as  test  nodes.  This  becomes  the  base  model 
for  all  subsequent  evaluations.  This  is  the  point  at  which  STAT 
allows  the  user  to  perform  a  variety  of  "What  if?"  type  scenarios. 
This  is  done  by  first  creating  a  new  model  that  is  identical  to  the 
base  model  with  a  simple  keystroke.  The  user  can  then  modify  the 
characteristics  of  each  node  within  the  model.  Most  nodes  will  be 
chosen  as  non-test  nodes.  However,  the  aspects  of  the  chosen  test 
nodes  can  also  be  further  modified.  The  type  of  access  required  to 
reach  a  node,  such  as  external  or  probe  accessible,  can  be 
specified.  The  cost  and  time  to  test  each  point  can  also  be 
specified.  The  failure  rate  and  replacement  time  and  cost  of 
individual  items  can  also  be  modified.  Other  factors  can  also  be 
modified  and  weighted.  The  weighting  allows  the  user  to  perform  an 


analysis  based  on  individual  specifications. 

One  aspect  of  the  operation  of  STAT  deserves  special  mention. 
Along  with  identifying  feedback  loops  within  the  model,  STAT  has  an 
interactive  feature  called  the  Feedback  Loop  Breaker.  This  feature 
identifies  feedback  loops,  then  recommends  the  no ^es  at  which  to 
break  the  feedback  loops  and  the  new  characteristics  resulting  from 
the  breakage.  The  resulting  ambiguity  groups  can  be  viewed 
immediately.  This  allows  the  user  to  continue  to  modify  the  loops 
until  the  required  ambiguity  group  requirements  are  met.  This  is 
just  one  aspect  of  a  very  workable  user  interface.  Once  all  the 
terminology  associated  with  STAT's  modeling  is  understood,  it  is 
easy  to  maneuver  through  many  screens  and  perform  a  testability 
analysis . 

The  outputs  from  STAT  take  on  a  variety  of  forms  that  range 
from  the  concise  and  informative  to  the  long  and  drawn  out.  The 
longer  outputs  are  more  applicable  to  the  maintenance  aspects  of 
testability  and  not  to  the  design  process.  Each  output  repc  t 
contains  a  summary  section  and  a  detailed  section.  This  is  very 
helpful  when  trying  to  locate  a  particular  piece  of  information 
without  turning  through  perhaps  hundreds  of  pages.  The  management 
model  report  contains  a  graphical  representation  of  the  topological 
dependencies  for  a  particular  model.  This  allows  the  user  to  more 
clearly  see  the  loops  that  are  not  easily  identifiable  in  the 
original  circuit  diagram.  The  feedback  loop  indicator  report 
contains  a  detailed  information  on  all  feedback  loops  in  a  model. 
This  includes  information  on  the  relative  complexity  of  the 
feedback  loops  and  information  on  the  sizes  of  the  ambiguity  groups 
resulting  from  the  breaking  of  the  loops.  The  topological  indicator 
report  contains  the  most  information  to  determine  the  inherent 
testability  of  a  circuit.  The  fault  isolation  level  is  given  for 
each  ambiguity  group  size  present  in  the  circuit.  This  FOM  is  a 
primary  means  of  verifying  whether  testability  requirements  are 
being  met.  All  the  characteristics  of  each  ambiguity  group,  test, 
and  item  are  given  in  the  detail  section.  This  information  is  in  an 
easy  to  read  form,  but  for  large  circuits,  the  detailed  section 
can  become  quite  large.  The  last  report  produced  is  the  fault 
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isolation  indicator  report.  The  processing  of  this  report  provides 
a  means  for  determining  the  number  and  types  of  tests  to  use.  The 
user  has  options  regarding  the  acceptable  size,  time  to  replace, 
and  cost  to  replace  for  ambiguity  groups.  The  user  can  also  select 
the  number  and  type  of  tests  to  consider.  The  report  then  lists  the 
suggested  tests  to  use  and  tests  that  are  not  used.  This  report 
also  includes  a  detailed  flow  diagram  for  the  isolation  of  faults. 
This  diagram  includes  the  cost  and  time  spent  testing  to  reach  a 
particular  point  in  the  isolation  procedure.  The  first  three 
reports,  namely  the  management  model,  feedback  loop  indicator,  and 
topological  reports,  are  better  suited  to  evaluate  the  testability 
of  a  particular  design. 

Using  the  dependency  modeling,  STAT  can  analyze  any  circuit 
regardless  of  whether  it  is  analog,  digital,  or  hybrid.  The  only 
difference  is  in  the  amount  of  modeling  require'^  for  a  particular 
CCA.  The  multitude  of  output  information  allows  the  evaluation  of 
just  about  any  aspect  of  the  testability.  The  most  important  and 
useful  of  which  are  the  detailed  fault  isolation  and  ambiguity 
group  parameters. 

CRITERIA  and  SCORES  for  STAT: 

(1)  9  :  The  presentation  of  the  fault  isolation  levels  in  a 

simple  table  is  the  ideal  result  to  express  the 

testability  of  a  circuit. 

(2)  10  :  The  feedback  loop  breaker  provides  information  directly 

on  the  results  of  breaking  loops.  The  interactive  nature 
of  this  feature  makes  dealing  with  feedback  loops 
extremely  easy. 

(3)  9  :  Provides  informative,  detailed  suggestions  regarding  the 

placement  of  test  nodes. 

(4)  9  :  STAT  provides  very  detailed  information  on  all  aspects  of 

ambiguity  groups  and  feedback  loops. 

(5)  9  :  Very  detailed  descriptions  of  test  strategy  and 

suggestions  as  to  what  type  of  test  to  use. 

(6)  6  :  The  interactive  data  editor  makes  data  entry  easy,  but 
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having  to  have  the  proper  prefixes  for  all  nodes  and 
items  is  irritating. 

(7)  4  :  The  development  of  the  dependency  model  for  complex 

circuits  can  become  very  tedious  and  time  consuming. 

(8)  8  :  The  descriptive  n  es  of  all  the  choices  presented  and 

the  on-line  help  r  xe  STAT  an  easy  program  to  operate. 

VIII.  STAMP 

STAMP,  Which  stands  for  System  Testability  and  Maintenance 
Program,  is  a  tool  designed  for  the  analysis  of  system  testability 
and  fault-isolation  strategies.  STAMP  is  a  program  from  ARINC 
Research  Corporation  of  Annapolis,  MD.  It  runs  on  an  HP-1000 
mini-computer  and  recently  a  PC-based  version  has  been  announced. 
STAMP  is  for  the  use  of  ARINC  employees  only  and  primarily  for 
government  contracts.  It  is  unfortunate  that,  at  this  time,  it  is 
not  available  to  the  end  user  directly.  For  a  tool  to  be  effective 
in  the  design  phase,  it  must  be  readily  available  to  the  design 
personnel.  Then,  when  design  changes  in  the  circuit  occur,  their 
effect  on  the  circuit's  inherent  testability  can  be  quickly 
determined  and  suggestions  made  back  to  the  design  personnel  on  how 
to  improve  the  circuit's  testability.  We  developed  the  models  for 
the  three  circuits  and  sent  them  to  ARINC  to  be  processed.  The 
quick  turn-around  for  receiving  the  results  would  be  sufficient 
when  doing  an  analysis  of  a  previously  designed  circuit  for  testing 
purposes  only.  However,  for  a  design  work,  the  results  must  be 
available  as  quickly  as  possible.  STAMP  performs  its  analysis  based 
on  the  first-order  dependency  model. 

The  discussion  on  the  dependency  model  in  the  previous  section 
applies  to  STAMP  as  well  except  for  the  differences  in  terminology 
for  tests  and  items.  STAMP  refers  to  these  as  events  and  elements, 
respectively.  The  use  of  the  dependency  model  is  essential  for 
modeling  hybrid  and  analog  circuits  properly.  Again  the  main 
disadvantage  of  dependency  modeling  is  the  large  amount  of  time 
required  to  model  analog  and  hybrid  circuits.  The  user  has  several 
options  for  inputting  the  dependency  information.  The  dependency 


information  can  be  entered  directly  into  STAMP  through  an 
interactive  interface,  through  an  ASCII  file,  or  through  the 
wire-list  output  of  the  popular  schematic  drawing  program  Schema. 
We  produced  our  dependency  lists  which  were  then  entered  directly 
into  STAMP  manually.  STAMP  has  default  labels  for  all  the  aspects 
of  the  dependency  model.  However,  these  labels  may  be  modified  so 
that  the  labels  on  the  output  information  can  be  related  directly 
back  to  the  original  schematic.  In  addition  to  the  first-order 
dependency  information,  a  wide  range  of  modifiable  parameters  are 
also  available.  These  include  failure  rates,  different  types  of 
test,  test  time  and  cost,  test  grouping,  component  grouping,  etc. 
The  ease  with  which  the  data  can  be  entered  is  not  a  criteria  that 
we  can  judge  fairly  since  we  did  not  actually  run  STAMP.  Likewise, 
the  options  available  when  obtaining  output  cannot  be  fairly 
judged.  However,  the  extensive  outputs  can  be  analyzed  and  they  do 
contain  the  information  on  testability  parameters. 

The  outputs  are  separated  into  two  groups,  namely,  the 
testability  analysis  and  the  fault-isolation  analysis.  In  addition, 
the  lists  of  all  components,  dependencies,  and  a  wide  variety  of 
other  lists  are  also  available  from  the  output.  The  amount  of  paper 
consumed  can  become  excessive,  but  the  reports  can  be  printed 
separately.  Our  primary  interest  is  in  the  testability  analysis 
section  which  provides  24  different  measures  of  testability.  All 
the  results  are  normalized  to  provide  the  user  with  an  idea  of  how 
the  measures  compare  relatively.  Most  of  the  measures  are  slightly 
esoteric,  but  several  provide  very  information  on  what  to 
concentrate  and  on  how  to  improve  the  inherent  testability.  The 
outputs,  that  are  the  most  informative,  are  the  operational 
isolation  measures.  These  are  commonly  referred  to  as 
fault- isolation  levels.  These  are  one  of  the  means  of  stating  the 
required  testability  specifications  in  the  SOW  of  government 
contracts.  They  are  the  primary  FOM  for  inherent  testability.  The 
two  primary  results  give  the  fault-isolation  for  a  given  ambiguity 
group  size.  One  output  considers  the  weighting  of  the  failure  rates 
while  the  other  does  not.  This  allows  for  additional  flexibility 
for  the  specifications  to  achieve  desired  testability  levels.  STAMP 
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provides  information  on  ambiguity  groups  such  as  number,  size  and 
component  members.  One  type  of  output,  that  is  worth  noting,  is  the 
listing  of  excess  tests  and  redundant  tests.  This  information  is 
very  useful  when  the  number  of  test  nodes  available  is  limited  by 
some  other  factor  such  as  connector  pins.  Components  and  tests  that 
comprise  feedback  loops  are  also  listed.  The  user's  manual  states 
that,  in  addition  to  listing  the  loops,  additional  tests  are 
identified  that  could  be  in  dealing  with  the  loops.  We  couxd  not 
find  these  suggestions  in  any  of  the  outputs  that  were  supplied  to 
us.  These  are  the  most  useful  of  the  testability  analysis  outputs. 
Additional  outputs  are  available  that  describe  a  variety  of  other 
conditions  and  parameters. 

The  fault-isolation  analysis  provides  the  information  that  a 
technician  would  use  in  the  isolation  of  a  fault.  Lists  are  given 
which  identify  the  tests  that  will  be  bad  given  a  failure  in  a 
particular  component.  Probabilities  of  failure  figures  are  also 
given  for  all  components  and  tests.  A  tabular  decision  tree  and  a 
graphical  tree  can  be  produced  that  detail  the  steps  a  technician 
would  follow  in  the  fault  isolation  procedure.  The  procedure  can  be 
affected  by  different  weightings  and  options  associated  with  test 
time,  test  cost,  test  groupings,  component  groupings,  etc.  Finally, 
a  summary  of  the  testing  procedure  in  terms  of  the  number  of  steps 
required  to  isolate  faults  is  given. 

With  the  use  of  dependency  modeling,  STAMP  can  also  analyze 
any  type  of  circuit  regardless  of  whether  it  is  digital,  analog,  or 
hybrid.  The  drawback  to  this  is  the  extensive  amount  of  modeling 
that  must  be  performed  to  properly  analyze  analog  and  hybrid 
circuits.  The  major  feature  missing  from  STAMP  is  the  lack  of 
information  on  breaking  up  feedback  loops.  The  most  useful  output 
provided  is  the  data  on  fault  isolation  levels  related  to  ambiguity 
group  size.  The  extensive  output  information  for  about  every 
conceiva;  e  seems  excessive,  but  may  find  application  in  different 
situations. 

NOTE:  STAMP  will  only  be  an  effective  tool  for  incorporating 
testability  in  the  design  phase  if  it  is  available  for 


use  by  the  design  personnel.  The  time  needed  to  obtain 
the  results  and  the  need  to  experiment  with  many 
different  testing  scenarios  makes  this  a  necessity. 

CRITERIA  and  SCORES  for  STAMP: 

(1)  10  :  The  operational  isolation  measures  for  both  the 

consideration  or  exclusion  of  failure  rates  is  the  most 
result. 

(2)  0  :  No  suggestions  are  provided  for  the  breaking  of  feedback 

loops.  This  is  a  major  drawback  since  the  existence  of 
feedback  loop  is  a  primary  contributor  to  poor 
testability.  Invariably  all  real  world  CCAs  will  have 
feedback  loops. 

(3)  6  :  The  listing  of  redundant  and  excessive  tests  is  very 

useful  for  large  designs  where  all  possible  paths  to  the 
output  tests  are  not  always  clear.  However,  no 
suggestions  are  made  regarding  the  placement  of 
additional  tests. 

(4)  9  :  STAMP  provides  detailed  listings  of  every  aspect  of  the 

dependencies,  ambiguity  groupings,  etc. 

(5)  8  ;  Detailed  test  procedures  are  given  in  decision  tree  form. 

(6)  8  :  The  existence  of  a  CAD  interface  with  Schema  can  make 

data  input  easier.  The  flexibility  of  the  labeling  of 
components  and  tests  makes  data  entry  and  verification 
easier. 

(7)  4  :  The  development  of  the  dependency  model  for  complex 

circuits  can  become  very  tedious  and  time  consuming. 

(8)  N/A  ;  We  cannot  evaluate  this  aspect  of  STAMP  since  we  did  not 

actually  use  the  program. 

IX.  I-CAT 

I-CAT,  which  stands  for  Intelligent  Computer  Aided  Testing,  is 
a  model  based  expert  system  designed  for  fault-diagnosis 
applications.  I-CAT  is  a  software  program  from  Automated  Technology 
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Systems  Corporation.  We  ran  I-CAT  under  SunView  on  a  Sun 
SPARCstation  1  with  8  MBytes  RAM  installed  and  a  color  monitor. 
Although  I-CAT  is  primarily  designed  for  fault-diagnosis,  it  has 
several  features  that  make  it  attractive  for  performing  testability 
analysis.  I-CAT  also  requires  essentially  the  same  type  of  models 
as  those  required  by  STAT  and  STAMP.  The  only  difference  is  that 
I-CAT  refers  to  the  model  as  a  functional  model  rather  than  a 
dependency  model,  although  both  contain  the  same  information. 

The  model  information  is  entered  into  I-CAT  through  a 
graphical  interface.  I-CAT  defaults  to  starting  all  component 
labels  with  a  'C'.  This  can  be  altered  to  anything  that  is 
considered  appropriate  by  the  user.  When  test  nodes  are  placed  in 
the  diagram,  they  can  have  any  label  that  has  not  already  been 
used.  Since  the  model  can  be  seen,  it  is  easier  to  debug  the  input 
data  with  I-CAT  than  other  programs.  However,  this  is  not  always 
the  case  with  large,  cumbersome  models.  Even  when  using  the  grid  to 
aid  in  the  alignment  of  connecting  lines,  it  is  often  difficult  to 
make  the  connections  if  the  view  is  not  zoomed  in.  Any  open  lines 
resulting  from  these  bad  connections  have  the  same  effect  as  in  a 
real  circuit  since  the  signal  path  is  broken.  These  are  sometimes 
difficult  to  spot  unless  the  view  is  zoomed  in  quite  close.  Trying 
to  maneuver  around  a  large  model  can  be  frustrating  due  to  the  lack 
of  a  dynamic  scroll.  However,  I-CAT  encourages  the  use  of  nested 
models  to  make  the  model  easier  to  create,  edit,  and  view.  Any 
component  can  contain  a  substructure  that  can  consist  of  another 
model.  I-CAT  contains  a  built-in  syntax  checker  that  verifies  the 
graphical  model  for  connectivity  errors.  This  is  essential  since 
the  graphical  interface  can  sometimes  be  deceiving.  Along  with  the 
functional  model,  various  other  parameters  may  be  entered,  such  as 
component  cost,  component  failure  rate,  and  test  cost. 

I-CAT' s  development  tools  generate  a  wide  variety  of  outputs 
that  have  many  uses.  We  focus  primarily  on  the  testability  analysis 
data.  Several  pertinent  input  options  are  available  to  the  user 
that  have  an  effect  on  the  testability  analysis,  namely,  whether  or 
not  to  allow  multiple  faults,  whether  or  not  to  use  only  the 
assigned  test  nodes,  and  whether  or  not  to  test  inside  all  the 
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substructures.  If  the  assigned  test  nodes  are  not  used,  the  number 
of  test  nodes  to  use  may  be  entered.  The  primary  outputs  of  the 
testability  analysis  are  the  line  tree  and  analysis  report.  The 
line  tree  provides  a  graphical  representation  of  the  testing 
procedure.  This  is  also  given  in  other  forms  such  as  a  bar  tree  and 
a  flow  chart.  The  analysis  report  contains  statistical  information 
on  the  cost  of  diagnosis,  the  replacement  costs,  and  the  ambiguity 
group  sizes.  Statistics  such  as  variance  and  standard  deviation  are 
given.  The  ambiguity  group  information  contains  no  information 
about  which  components  belong  to  which  group.  This  makes  it 
difficult  to  modify  a  design  to  have  smaller  ambiguity  groups. 
Information  is  also  provided  on  feedback  loops  and  test  nodes. 
I-CAT  identifies  feedback  loops  and  the  components  that  comprise 
each  loop.  No  suggestions  are  given  about  where  to  break  the  loops. 
The  test  node  data  is  the  most  useful  of  the  testability  analysis 
outputs.  It  details  which  nodes  are  used,  not  used,  and  any 
additional  nodes  that  may  be  required.  The  last  output  is  only 
given  if  the  assigned  test  nodes  are  ignored.  It  is  not  clear  what 
testability  parameter  of  the  model  is  being  maximized  by  the  choice 
of  test  nodes.  There  are  no  outputs  that  give  the  testability  of 
the  model  in  terms  of  fault  coverage  and  ambiguity  group  sizes. 
These  two  aspects  are  the  important  ones  for  DFT  and  it  is 
unfortunate  that  these  are  not  available  from  I-CAT.  I-CAT  can  also 
produce  diagnostic  programs  in  various  languages,  test  requirement 
documents  (TRD) ,  netlist  information,  and  a  Personal  ATLAS 
Workstation  (PAWS)  data  base. 

I-CAT  also  has  several  additional  features  that  deserve 
mention,  although  not  directly  related  to  design  for  testability. 
These  are  a  troubleshooter,  fault  simulator,  and  a  rule  editor.  The 
troubleshooter  is  an  interactive  graphical  interface  for 
diagnostics.  The  user  enters  components  known  to  be  good  and  the 
results  of  tests.  The  troubleshooter  can  direct  a  technician  on  how 
to  proceed  with  the  testing  of  a  device.  This  is  all  accomplished 
through  extensive  dialog  boxes  that  are  full  of  information 
previously  entered  into  I-CAT.  The  symptoms  reported  at  test  nodes 
can  also  be  entered  into  I-CAT.  The  user  can  even  step  backward 
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through  a  diagnostic  session.  Perhaps  the  most  important  aspect  of 
the  troubleshooter  is  that  the  failure  information  can  be  saved  to 
create  a  history  of  components  that  fail.  The  fault  simulator  can 
be  used  to  compare  the  testing  strategies  used  by  technicians  and 
the  strategies  offered  by  I-CAT.  I-CAT  randomly  introduces  a  fault 
into  the  model  and  it  is  up  to  the  user  to  find  its  location.  The 
user  can  keep  track  of  the  number  of  steps  required  to  isolate  a 
fault  and  then  let  I-CAT  find  the  fault.  This  way  a  technician  can 
train  on  a  particular  circuit  without  having  to  damage  the  actual 
hardware.  The  rule  editor  is  probably  what  distinguishes  I-CAT  from 
the  other  testability  tools.  Information  on  all  aspects  of  the 
tests  and  the  testing  procedure  is  entered  through  the  rule  editor. 
The  parameters  associated  with  tests  range  from  the  dimensions  of 
the  measured  quantities  to  the  probabilities  of  test  results.  These 
quantities  along  with  the  failure  history  of  the  model  from  the 
troubleshooter  can  be  used  to  increase  the  quality  of  the  testing 
procedure.  These  features  are  primarily  concerned  with  manual 
testing  or  any  type  of  testing  where  operator  intervention  is 
necessary. 

With  functional  modeling,  I-CAT  can  also  analyze  all  types  of 
circuits  whether  they  are  analog,  hybrid,  or  digital.  Once  again 
the  drawback  to  this  is  the  extensive  amount  of  modeling  required 
for  analog  and  hybrid  circuits.  Although  I-CAT  is  primarily  a  fault 
diagnosis  tool,  the  information  on  test  node  selection  and  feedback 
loops  is  informative  .  However,  the  lack  of  fault  isolation 
parameters  and  details  about  ambiguity  groups  make  I-CAT  better 
suited  for  establishing  testing  procedures  once  the  circuits  are 
designed. 

CRITERIA  and  SCORES  for  I-CAT: 

(1)  0  :  No  final  testability  measure  is  given. 

(2)  0  :  No  information  on  where  to  break  feedback  loops. 

(3)  9  :  The  details  concerning  test  nodes  are  the  most  useful  of 

all  the  outputs  with  regard  to  design  for  testability. 

(4)  4  :  Only  limited  information  on  ambiguity  groups  and  feedback 
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loops . 

(5)  8  :  Extensive  outputs  concerning  the  testing  of  the  circuit 

including  source  code  and  documents. 

(6)  8  :  Despite  the  sometimes  picky  graphical  interface,  I-CAT  is 

flexible  with  labels  and  the  model  data  is  easy  to  enter 
and  verify. 

(7)  4  :  The  development  of  the  functional  model  for  complex 

circuits  can  be  tedious  and  time  consuming. 

(8)  9  :  The  menu  driven,  mouse  controlled  interface  was  easy  to 

use  and  very  self-explanatory. 

X.  SUMMARY 

There  are  sufficient  motivations  to  include  testability  during 
the  design  phase  of  electronic  systems.  To  include  testability 
analysis  properly,  the  appropriate  tool  must  be  chosen  to  evaluate 
and  improve  the  testability  of  the  CCAs.  The  testability  analysis 
tool  must  be  able  to  evaluate  circuits  whether  they  are  digital, 
analog,  or  hybrid. 

Each  tool  was  evaluated  for  a  digital,  analog,  and  a  hybrid 
circuit  to  assess  the  function  over  a  wide  variety  of  possible 
applications.  Each  criteria  was  scored  on  a  scale  of  0  to  10  for 
each  tool.  The  scores  assigned  to  each  tool  for  all  the  criteria 
are  summarized  below  in  Table  1  along  with  the  final  weighted 
total. 

TABLE  1:  Scores  for  testability  tools. 

CRITERIA  (WEIGHT) 

TOOL  1(10)  2(9)  3(9)  4(8)  5(6)  6(5)  7(5)  8(5)  WEIGTHED 

TOTAL 

CAFIT  5  8  8  3  0  5  6  83 13 

STAT  9  10  999  648  477 

STAMP  10  0  6  9  8  8  4  n/a  334* 

I-CAT  0  0  9  4  8  8  4  92  66 
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*  -  STAMP  was  not  available  for  our  use.  As  a  result  the  user 
interface  could  not  be  evaluated  and  the  total  score  reflects 
this  loss. 

I-CAT  is  primarily  a  fault  diagnosis  tool  and  this  is  evident 
in  its  strong  showing  in  the  criteria  associated  more  with  this 
aspect  of  testing  and  not  DFT.  Its  multitude  of  outputs  concerning 
the  testing  of  a  CCA  makes  it  ideally  suited  to  repair  depot 
development.  I-CAT  has  the  ability  to  learn  from  the  experience  of 
personnel  with  its  use  of  expert  systems  technology.  Its  ability  to 
lead  a  technician  through  the  repair  of  a  device  also  makes  it 
attractive  for  use  at  the  depot  level.  However,  in  its  present 
form,  it  is  not  suitable  tool  for  testability  analysis  during  the 
design  phase. 

CAFIT  has  the  advantage  of  being  government  owned  and  thus  it 
is  readily  available  for  use.  Its  library  facility  reduces  the  time 
required  in  modeling  of  digital  circuits  by  maintaining  the  model 
for  each  device.  However,  the  complexities  associated  with  hybrid 
and  analog  circuits  make  their  modeling  difficult  and  necessary  on 
an  individual  basis.  CAFIT  can  aid  in  the  selection  of  test  nodes, 
but  does  not  give  any  information  regarding  the  ambiguity  groups, 
testing  procedures,  or  an  FOM  dealing  with  fault  isolation  levels 
and  ambiguity  groups.  If  improvements  can  be  made  to  CAFIT  to  make 
it  more  useful  for  the  design  of  CCAs,  it  would  be  an  attractive 
testability  analysis  tool  since  it  is  already  government  owned. 

STAMP  provides  an  output  for  just  about  every  aspect  of  a  CCA. 
STAMP  does  provide  a  concise  FOM  in  terms  of  fault  isolation  levels 
and  ambiguity  groups.  However,  even  if  the  total  assigned  point 
value  was  added  for  the  user  interface,  STAMP  still  would  fall 
short  due  to  its  lack  of  suggestions  on  the  placement  of  test  nodes 
and  breaking  of  feedback  loops.  These  features  are  required  if  the 
tool  is  to  be  effective  in  the  design  process.  Furthermore,  its 
lack  of  availability  prohibits  it  from  being  an  effective  tool  for 
design  purposes. 

STAT  provides  very  extensive  outputs  that  can  aid  in  the 
design  of  testable  CCAs.  Its  numerous  user  options  allow  the 
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designer  to  use  a  type  of  "What  if.."  scenario.  The  numerous  test 
node  suggestions  and  the  interactive  feedback  loop  breaker  make 
STAT  a  suitable  tool  to  aid  in  the  design  of  testable  cCAs.  STAT 
also  provides  an  FOM  in  terms  of  fault  isolation  and  ambiguity 
groups.  Its  only  drawback  is  the  time  consuming  effort  that  is 
required  to  develop  the  dependency  models.  Eut  if  a  facility  to 
model  electronic  components  could  be  established,  the  time  required 
to  develop  the  models  can  be  eliminated. 

XI.  RECOMMENDATIONS 

Based  on  the  previous  discussions  and  observations,  we 
conclude  that,  at  the  present  time,  STAT  would  be  the  most 
appropriate  testability  tool  to  be  used  in  the  design  phase.  It 
possesses  all  the  essentials  to  aid  in  the  design  of  testable  CCAs. 
However,  if  some  improvements  can  be  made  to  CAFIT,  it  can  also 
become  a  powerful  tool  for  DFT.  If  the  latter  is  preferred  because 
it  is  already  government  owned,  the  following  suggestions  are  made 
for  the  improvement  of  cafiT: 

(a)  Provide  an  FOM  in  terms  of  fault  isolation  levels  and 
ambiguity  group  sizes. 

(b)  Provide  information  on  ambiguity  groups  such  as  size  and 
number . 

(c)  Develop  a  model  library  for  analog  components. 

(d)  Allow  data  to  be  entered  easier  if  the  CAE  interface  is  not 
available. 

(e)  Provide  information  on  testing  strategy  or  procedure. 

At  the  present  time,  SMARTCAT,  the  upgrade  to  CAFIT,  was  not 
available  for  our  use.  It  is  not  known  whether  any  of  these 
suggestions  have  already  been  incorporated.  However,  if  the  above 
improvements  are  made,  SMARTCAT  should  be  looked  at  closely  to 
determine  whether  it  is  an  acceptable  tool  for  DFT. 
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ABSTRACT 

Water,  when  mixed  with  or  applied  to  hydrocarbon  fuels, 

substantially  increases  the  rate  of  7aporizc.tion,  with 

significant  decreases  in  boilirg  joints  of  the  fuel 

components,  due  to  azeotropic  "steam  distillation"  effects. 
At  ambient  room  temperatui -s  it  was  found  in  this 

investigation  that  there  is  essentially  no  increase  in 
volatility  (upon  addition  of  water  to  the  fuel) ,  in  terms  of 

reductions  of  flash  points  of  fuel  components,  and  therefore 
there  is  essentially  no  increased  flammability  for  mixtures 
of  fuel  with  water  o*:  ambient  temperatures.  It  was  also 
found,  however,  that  the  increases  in  volatility  for  water- 
fuel  mixtures  can  be  very  pronounced  in  high  temperature 
situations,  as  would  be  observed  in  fully  developed 

hydrocarbon  fuel  fires  which  are  being  extinguished  with 
water  fog,  AFFF,  or  other  water  based  extinguishing  agents  or 
systems.  Moreover,  it  was  found  in  this  investigation  that 
the  increased  volatility  effects  are  particularly  severe  for 

low  volatility  (high  boiling  point)  fuels  such  as  JP-8,  and 
that  can  be  expected  to  pose  correspondingly  severe  problems 

in  fire  fighting  efforts  for  such  low  volatility  fuels. 
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INTRODUCTION 

Water  has  been  used  for  thousands  of  years  as  a  fire 
extinguishing  agent,  either  alone  or  as  the  main  component  of 
a  variety  of  agent  compositions  such  as  AFFF  or  other  water 
based  extinguishing  compositions.  The  greatest  single  effect 
of  the  water  in  such  applications  is  the  great  cooling 
capacity  of  the  water,  which  thus  provides  a  capability  for 
lowering  the  temperature  of  the  burning  fuel  below  its  flash 
point,  thus  removing  one  of  the  four  essential  conditions  for 
maintenance  of  the  fire.  (In  addition  to  heat,  the  other 
essential  bases  of  the  so-called  fire  tetrahedron  are  oxygen, 
the  fuel  itself,  and  the  existence  of  propagating  free 
radical  pathways  in  the  flame  system.)^ 

It  has  been  very  well  established  that  there  are  several 
types  of  situations  in  which  application  of  water  actually 
serves  to  intensify  a  fire. 

1.  When  applied  to  very  hot  oil  or  grease  fires,  water 
will  flash  into  steam,  causing  a  spattering  effect 
which  will  greatly  intensify  the  fire. 

2.  Water  reacts  explosively  with  active  metal  fuels 
such  as  sodium. 

3.  Direction  of  a  strong,  vigorous  jet  of  water  from  a 

fire  hose  into  burning  liquid  fuel  can  result  in 
mechanical  "digging"  effects  which  will  scatter  the 

burning  liquid  over  a  much  wider  area,  causing  a 
serious  increase  in  size  of  the  blaze. 
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4. 


"Boil  over"  can  result  from  formation  of  a  heat  wave 


progressing  downward  through  burning  fuel  floating 
on  water,  finally  reaching  the  water  and  causing 

this  to  come  to  a  rapid  boil  with  forcible  ejection 
of  the  burning  fuel  upward  from  the  surface.  "Boil 
over"  is  observed  only  for  burning  fuel  mixtures 
comprised  of  both  high  and  low  density  components 
(^.e. ,  the  effect  is  not  observed  for  pure  liquids); 
the  fuel  mixture  must  be  floating  on  water;  and  the 

effect  requires  several  hours  of  build-up  time 
before  it  is  observed. 

None  of  the  above  are  in  any  way  anomalous;  none  are 
involved  in  any  way  with  the  type  of  azeotroping  effect  to  be 
described  in  this  report. 

It  is  curious  that  azeotroping  have  thus  far  never  been 
considered  in  any  firefighting  research  efforts,  particularly 
since  it  can  be  shown  to  be  at  least  as  serious  for  fire 
fighting  considerations  as  any  of  these  other  four  effects. 
Azeotropyis  certainly  a  well-known  and  well  described 
phenonmenon,  forming  the  basis  of  what  is  variously  termed 
"steam  distillation",  or  immiscible  phase  azeotropy.  This  is 
a  technique  which  has  been  practiced  on  a  very  large 
industrial  scale,  as  a  means  of  performing  distillations  at 
relatively  low  temperatures  for  what  would  ordinarily  be  very 
low  volatility,  high  boiling  point  liquids.  A  brief 
discription  of  the  principles  of  steam  distillation  is 
provided  below.  (For  a  more  complete  discussion  of  this 
effect,  see  references  [3]  and  [4].) 
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As  shown  in  Figures  1-3  for  benzene,  xylene  and  water, 
the  boiling  point  of  any  liquid  or  mixture  of  liquids  is  that 
temperature  at  which  the  vapor  pressure  of  the  liquid  system 
exactly  equals  the  atmospheric  pressure  (the  standard 
atmospheric  pressure  being  760  mm  Hg)  .  (Benzene  and  xylene 
will  be  discussed  in  detail  in  this  paper,  since  each  has 
boiling  points  which  are  analogous  to  the  boiling  points  of 
volatile  JP-4  and  less  volatile  JP-8  fuels,  respectively.) 

In  Figure  1,  the  boiling  point  of  benzene  is  seen  as 
69"  (176®  F) ;  at  which  temperature  its  vapor  pressure  is 
760  mm  (one  atmosphere) .  In  Figure  2,  xylene  boils  at  139"  C 
(282"  F) ,  with  a  vapor  pressure  of  760  mm;  and  water  has  a 
vapor  pressure  of  one  atmosphere  at  100"  C  (212"  F). 

If  two  liquids  are  immiscible  (insoluble  in  each  other) , 
each  phase  will  exert  its  own  vapor  pressure  at  a  given 
temperature,  and  the  total  pressure  will  then  be  the  s\im  of 
the  vapor  pressures  for  each  liquid  at  that  temperature. 
Figures  4  and  5  shows  the  azeotropic  effects  which  result  in 
significantly  decreased  boiling  points  when  two  immiscible 
(insoluble)  liquids  such  as  benzene  and  water,  or  xylene  and 
water  are  mixed. 

Thus,  in  Figure  4  for  the  insoluble  mixture  of  benzene 
and  water,  at  69"  C  (156"  F)  benzene's  vapor  pressure  is  533 
mm  Hg,  and  water  has  a  vapor  pressure  of  227  mm  Hg.  Since 
the  total  pressure  is  760  mm,  the  mixture  will  boil  at  this 
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lowered  temperature,  some  11°  C  (52°  F)  lower  than  the 
boiling  point  of  pure  benzene. 

The  effect  becomes  even  more  pronounced  for  higher 
boiling  boint  liquids,  as  can  be  seen  in  Figure  5  for  the 
insoluble  mixture  of  water  and  xylene.  At  94.5°  C  (202°  F) , 
the  vapor  pressures  of  xylene  and  water  total  one  atmosphere 
—  some  45°  C  (80°  F)  cooler  than  for  xylene  alone. 

(Figures  4  and  5  also  indicate  another  term,  azeotropic 
overpressure;  this  important  feature  will  be  discussed  in 
detail  later  in  this  report.) 

All  of  these  azeotropic  considerations  can  have  serious 
implications  for  the  flammability  of  hydrocarbon  fuels  in 
contact  with  water  —  in  the  case  when  fuel  fires 
are  being  extinguished  by  water  or  water  based  extinguishing 
agents  such  as  AFFF. 

Fuel  flammability,  and  intensity  of  fire  for  the  fuel, 
is  typically  regarded  in  terms  of  the  fuel's  flash  point  — 
_i.e.,  temperature  of  the  liquid  at  which  its  vapors  are 
sufficiently  present  over  the  fuel  to  sustain  a  fire.^  The 
more  flammable  fuels  are  those  with  the  lower  flash  points, 
and  fuels  with  higher  flash  points  are  typically  regarded  as 
being  more  safe  from  the  standpoint  of  such  ignitions. 


22-7 


8 


Due  to  these  volatility  considerations,  aviation  fuels 
have  undergone  dramatic  changes  since  World  War  II.  In  1951 
the  US  Air  Force  and  Army  changed  from  a  highly  vol  ile 
blend  of  gasoline  and  kerosene  to  the  less  volatile  JP-4 
formulation  still  widely  used  by  these  services  today.  In 
1952  the  US  Navy  adopted  a  much  less  volatile  blend  (JP-5)  as 
a  result  of  the  extreme  fire  fighting  constraints  peculiar  to 
Navy  carrier  operations.  In  1958  an  intermediate  blend  (less 
volatile  than  JP-4,  but  more  volatile  than  JP-5)  was  adopted 
as  Jet  A-1  fuel  for  use  in  commercial  aviation;  and  since 
1968  a  slightly  modified  version  of  Jet  A-1,  designated  as 
JP-8,  has  been  gradually  implemented  for  use  by  NATO  and 
USAFE  aircraft.  The  characteristics  and  compositions  of 
these  principal  JP  fuels  are  shown  in  Tables  I  and  II. 

TABLE  I.  FUEL  CHARACTERISTICS'"^^ 


US 

JP-4 

JP-5 

JP-8  Jet  h 

UK 

AVTAG 

AVTUR 

NATO 

F-40 

F-4  4 

F-34  F-35 

Specific  Gravity 

0.77 

0.83 

0.80 

Vapor  Pressure,  psi  [RT] 

3.0 

LESS 

THAN  0.1  PSI 

Flash  Point  (Fahrenheit) 

-20 

+  150 

+  125 
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Cirnn  NumMf 


Ciftoi 


TABLE  II,  DISTRIBUTION  OF 
HYDROCARBONS  IN  JP-4 ,  JP-5  AND 
JP-8,  DETERMINED  BY  SIMULATED 
DISTILLATIONS^ 


azeotropic  and  flash  point 


In  Table  III  are  presented 

data  for  Cg  through  Cg  hydrocarbons  —  these  being  prominen 
constituents  of  the  relatively  volatile  JP-4  fuel  widely  use 
in  the  Air  Force  today.  It  can  be  noted; 
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AZEOTROPIC  DATA  FOR  HYDROCARBON  MIXTURES  WITH  WATER 
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1.  There  is  a  significant  increase  in  both  boiling 
points  and  flash  points  of  the  pure  hydrocarbons, 
with  increasing  numbers  of  carbons  in  the  molecules 
(_i  .e. ,  with  increasing  molecular  weight)  . 

2.  Boiling  points  significantly  decrease  for  each  of 
these  hydrocarbons  when  mixed  with  water. 

3 .  There  are  bigger  and  bigger  decreases  in  azeotropic 

boiling  points  of  the  hydrocarbon/water  mixtures , 
with  increasing  numbers  of  carbons  in  these 

hydrocarbons .  (For  C  hydrocarbons,  reductions  in 
boiling  points  for  the  hydrocarbon/water  mixtures 

exceed  100°  F,  with  most  of  the  vapor  comprised  of 
the  hydrocarbon  fuel  molecules  1) 

The  matter  of  "azeotropic  overpressures",  referred  to  on 
page  7  in  the  discussion  of  Figures  4  and  5,  was  early 
regarded  in  this  work  as  being  an  area  of  prime  concern. 

In  a  liquid  fuel  fire,  the  surface  of  the  fire  the 
burning  liquid  is  at  its  boiling  point.  Although  there  will 
be  a  temperature  gradient  in  the  liquid  fuel  below  the 
surface,  there  will  be  a  signficant  fraction  of  the  liquid 
fuel  beneath  the  burning  surface  which  will  be  at  a 
considerably  elevated  temperature. 

If  a  water-based  extinguishing  agent  (e.^. ,  fog,  AFFF, 
or  even  a  solid  stream)  is  applied  to  this  fire,  the  incoming 
water  will  also  be  significantly  heated  as  it  passes  through 
the  flame  and  into  the  burning  liquid. 
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We  will  first  examine  a  high  volatility  fuel  such  as 
benzene  (see  Figure  4)  ,  with  a  volatility  representative  of 
the  major  components  of  JP-4.  If  the  benzene  is  heated  to 
its  boiling  point  (80°  C,  or  176°  F)  ,  and  water  added  at  a 
rate  such  as  to  allow  it  to  be  heated  to  about  this  same 
temperature,  the  vapor  pressures  of  the  water  (357  mm)  and 
of  benzene  (760  mm)  total  now  to  1117  mm.  This  is  an 
overpressure  of  357  mm  (about  J  atmosphere)  in  vapor  pressure 
which  has  suddenly  been  installed  in  what  had  been  a  gently 
boiling  liquid.  The  effect  will  be  similar  to  that  which  we 
would  observe  if  we  heated  the  benzene  to  92°  C  (198°  F)  in  a 
closed  pressure  cooker,  which  would  now  show  a  pressure  of 
about  7  psi  or  i  atmosphere  on  its  dial.  If  we  suddenly  open 
the  pressure  cooker,  the  contents  will  erupt  in  very  vigorous 
boiling.  This  same  effect  will  be  observed  on  addition  of 
the  water  to  the  boiling  benzene  (or  JP-4  type  of  fuel) ;  and 
a  somewhat  (though  perhaps  not  very  greatly  increased 
intensity  in  the  fire  will  be  observed. 

Figure  5  illustrates  a  low  volatility  fuel  such  as 
xylene  with  a  volatility  representative  of  major  components 
of  JP-8.  If  xylene  is  heated  to  its  boiling  point  (139°  C, 
282°  F)  ,  and  water  added  at  a  rate  to  allow  it  to  be  heated 
to  about  the  same  temperature,  the  vapor  pressures  of  water 
(2,640  mm)  and  of  benzene  (760  mm)  total  now  to  3,400  mm  — 
an  overpressure  of  2,640  mm  or  about  3i  atmosphere  in  vapor 
pressure,  again,  suddenly  unleased  in  what  had  been  a  gently 
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boiling  liquid.  The  effect  is  similar  to  that  which  we  would 
observe  if  we  heated  the  xylene  to  200“  C  (about  400“  F)  in  a 
closed  pressure  cooker,  which  would  now  show  a  pressure  of 
about  52  psi  or  3J  atmosphere  on  its  dial.  If  we  suddenly 
open  the  pressure  cooker,  the  contents  will  erupt  in  very 
violent  boiling.  This  same  effect  will  be  observed  on 
addition  of  water  to  boiling  xylene  (or  JP-8  type  of  fuel) ; 
and  an  extremely  greatly  pronounced  increased  intensity  in 
the  fire  will  be  observed. 


The  questions  then  arose: 

1.  Since  both  boiling  points  and  flash  points  of  the 
pure  hydrocarbons  are  both  linked  with  the 

molecular  weight  of  the  fuel  components;  and  since 
there  is  an  obviously  increased  volatility  of  the 
hydrocarbon  when  water  is  present,  is  there  a 
corresponding  decrease  in  the  flash  points  of  these 
hydrocarbons,  when  mixed  with  v^ater? 

2.  Is  there  a  correspondingly  increased  reduction  in 

flash  points  of  higher  molecular  weight  hydrocarbons 
as  are  found  in  JP-5  and  JP-8  fuels,  ordinarily 

considered  to  be  less  volatile  and  thus  more  fire 
safe  than  the  more  volatile  JP-4  types  of  fuels? 

3.  What  kinds  of  fire  fighting  situations  would  most 
contribute  to  unexpected  increases  of  vaporization 
of  the  fuel,  during  extinguishing  operations?  (If 
there  is  an  inordinate  increase  in  such 
vaporization,  it  might  be  anticipated  that  there 
could  be  concomitant  increases  in  flash  back. 
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fireballing  and  similar  unexpected  flame  flare-ups. 
Such  situations  could  be  particularly  haz?  dous  for 
large  scale  fire  fighting  operations.) 


PROJECT  OBJECTIVES 

The  overall  objective  of  this  project  were  to 
investigate  the  possible  effect  of  water  to  cause  jet  fuels 
and  similar  hydrocarbon  fuel  compositiors  to  burn  faster  and 
with  greater  instensities  when  the  water  is  applied  to  the 
burning  fuel,  as  a  result  of  steam  distillation  effects. 

In  the  first  phase  of  work  flash  points  were  determined 
for  representative  volatile  and  non-volatile  hydr  carbons , 
both  in  the  dry  state  and  when  mixed  vith  water. 

In  the  second  phase  of  the  projects,  effects  on  fire 
proclivities  were  studied,  involving  application  of  water  to 
boiling  hot  hydrocarbon  fuels  of  both  high  and  low 
volatilities  (as  wou.Td  be  representative  for  JP-4  and  J?-8 
types  of  aviation  taels,  respectively) . 

EXPERIMENTAL  WORK  PART  I.  FLASH  POINT  DETERMINATIONS. 

A  Herzog^^  semiautomatic  electric  Pensky-Mai. tens  closed 

1  7 

cup  electric  flash  point  tester*  (designed  for  determination 
of  flash  points  of  mixtures  of  multi-phase  systems,  in 
accordance  with  ASTM  D56  and  D93  procedures)  ,  and  a  Herzog^ 
semiautomatic  electric  Cleveland  open  cup  flash  point 
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tester  (designed  for  determination  of  flash  points  in 
accordance  with  ASTM  D92  procedures)  were  used  for; 

1.  Determining  flash  points  of  representative 
hydrocarbons  of  low  and  high  molecular  weight,  of 
known  flash  point,  by  way  of  providing  a 
standardization  for  the  equipment. 

2.  Determine  flash  points  of  a  variety  of  straight 
chain  and  branched  chain  alkanes,  to  ascertain 
possible  structural  effects  which  have  not  yet  been 
examined  with  regard  to  flash  points. 

3.  Determine  flash  points  of  variety  of  high  and  low 
molecular  weight  hydrocarbons  mixed  with  water,  to 
determine  possible  existence  of  steam  distillation 
effects  on  flash  points. 

4.  Attempt  a  quantification  of  such  steam  distillation 
effects,  if  any,  for  extrapolation  to  other 
hydrocarbon  fuel  components  and  mixtures. 

EXPERIMENTAL  RESULTS,  PART  I:  FLASH  POINT  DETERMINATIONS 
FOR  HYDROCARBONS  WITH  AND  WITHOUT  AZEOTROPIC  WATER  EFFECTS 


The  following  observations  were  made: 
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1.  Excellent  agreement  was  achieved  for  all  the  many 

standard  hydrocarbons  using  the  open  cup  flash  point 
tester,  and  for  the  few  values  available  for  closed 
cup  determinations. 

2.  ^  reductions  in  flash  points  were  observed  for  any 
of  the  standard  hydrocarbons  which  were  tested 
floating  on  water  in  the  open  cup  tester. 

As  a  corollary  to  this  observation,  it  was  also 
observed  that  water  can  be  a  very  beneficial 

additive  to  immiscible  (insoluble)  fuels  for 
flash  point  determinations,  at  least  for  fuels 

testing  at  flash  points  of  80“  C  or  lower,  and 
for  the  few  with  higher  flash  points,  probably 

extending  even  higher  than  this  value.  Thus, 
heating  is  facilitated  using  the  water.  Nowhere 

nearly  as  much  fuel  is  required  —  only  a  thin 
film  will  suffice.  The  apparatus  is  much  easier 
to  clean  after  each  determination.  Accuracy  is 
unimpaired  —  the  same  readings  are  achieved 

with  or  without  the  water. 

3.  For  flash  point  determinations  for  hydrocarbon  on 
water,  using  the  Pensky-Martens  closed  cup  electric 
flash  point  tester,  there  were  frequently  no  flash 
points  observed.  It  was  observed  on  such  occasions 
that  the  build-up  of  water  vapor  served  to  occlude 
air  from  the  closed  cup,  with  the  result  that 
ignition  was  impaired  or  obviated. 
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EXPERIMENTAL  RESULTS,  PART  II: 

APPLICATION  OF  WATER  TO  BOILING  HOT  HYDROCARBON  FUELS  OF 
HIGH  AND  LOW  VOLATILITIES  (REPRESENTATIVE  OF  JP-4  AND  JP-8) 


I 


» 


» 


For  these  determinations,  an  apparatus  was  constructed 
in  accordance  with  the  design  indicated  in  Figure  6. 


Figure  6.  Ignition  Flask 

A.  500-ml  round  bottom,  with 

B.  Two-necked  Claisen  head, 

C.  Pot  thermometer  well, 

D.  And  side  arm  extending 
from  Claisen  head. 

E.  Electric  heating  mantle. 

F.  Magnetic  stirrer. 

G.  Addition  funnel  for 
adding  water. 

H.  Capillary  extension  tube 
from  side  arm  tube. 

I.  Head,  pot  temperature 
thermometers . 

J.  Igniter 


Not  shown:  Aluminum  foil  insulation  around  assembly;  nitrogen 
tank  for  purging  air  from  assembly;  heating  tape 
for  side  arm  tube;  emergency  fire  extinguishers 


Procedures : 

1.  Add  100  ml  fuel  to  flask,  with  magnetic  stirring  bar. 

2.  Purge  air  from  assembly  with  nitrogen  tank. 

3.  Set  heating  mantle  and  heating  tape  to  about  20®  C  above 

boiling  point  of  fuel. 

4.  When  fuel  begins  to  distill  from  capillary  extension, 
light  distillate  with  igniter. 

5.  Add  1  ml  water  from  addition  funnel,  observe  flame  growth 
(if  any)  at  capillary  extension. 
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RESULTS  OF  EXPERIMENTS  WITH  XYLENE,  AND  XYLENE  AND  WATER; 

AND  WITH  BENZENE,  AND  BENZENE  AND  WATER 

1.  For  both  xylene  and  benzene  experiments,  distillation 
rates  were  achieved  (prior  to  adding  water)  which 
provided  just  enough  fuel  at  the  capillary  to  sustain  a 
small  flame  at  the  capillary  extension  tube. 

2.  For  distilling  benzene,  addition  of  water  did  not 
materially  increase  the  magnitude  of  the  flame. 


3.  For  distilling  xylene,  addition  of  water  resulted  in  a 
huge  increase  in  flame  size;  see  Figures  7  and  8  below. 


Figure  7.  Xylene  flame  Figure  8.  Xylene  flame  is 

prior  to  greatly  intensified 

adding  water.  by  adding  of  water. 


22-18 


19 


RAMIFICATIONS  OF  WATER  AZEOTROPING  EFFECTS  FOR  JP-8,  JP-5 
AND  JET  A-1  (AND  SIMILAR  LOW  VOLATILITY  HIGH  BOILING 
POINT  FUEL)  FIREFIGHTING  CONSIDERATIONS 

1.  Operational  firefighting 

No  previous  attention  appears  to  have  been  directed  to 
the  possibility  of  increased  flammability  hazards  arising 
from  azeotroping  effects  from  application  of  water  systems  to 
hydrocarbon  fuel  fires.  Statements  to  the  contrary  have  been 
encountered  in  responsible  fire  manuals: 

"  ...  water  . . .  entrained  in  fuel  ...  is  not  particularly 
significant  from  a  fire  hazard  viewpoint  ...  This 

is  valid  for  firefighting  implications  for  the  more  volatile 
JP-4  type  fuels;  but  it  is  an  over-simplification  from  the 
standpoint  of  highly  possible  increases  in  volatilities  for 
less  volatile  fuel  blends  such  as  JP-8,  JP-5  and  Jet-Al. 

There  is  at  least  one  instance  in  the  fire  fighting 
literature  which  can  now  possibly  be  reinterpreted  in  the 
light  of  a  possible  water/ fuel  azeotroping  effect. 

On  26  May  1981  an  EA-6B  crashed  into  several  F-14's  while 
landing  on  the  US  Navy  carrier  NIMITZ  (CVAN  68)  .  In  the 
ensuing  fire  14  men  were  killed  and  42  injured,  with  $60 
million  damages  to  the  carrier  and  its  planes.  Fire  fighting 
efforts  commenced  immediately,  using  water  hoses  and  AFFF 
washdown  systems  (although  the  AFFF  systems  were  not  deployed 
until  well  into  the  fire  fighting  effort)  .  In  a  subsequent 
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investigation  it  was  suggested  that  possibly  there  had  been 
contamination  of  JP-5  fuel  in  the  Navy  aircraft  by  JP-4  fuel 
as  a  result  of  refuelling  from  an  Air  Force  tanker;  and  that 
there  had  been  a  reduction  in  flash  point  of  the  Navy  jet 
fuel  as  a  result  of  the  possible  admixture  with  the  more 
volatile  JP-4.^^ 

A  possibility  that  greatly  increased  volatilization 
occurred  when  the  water  based  extinguishing  agents  (fog  or 
AFFF)  contacted  the  hot  fuel,  should  be  investigated  from  the 
standpoint  of  future  fire  fighting  technologies.  (It  should 
be  noted  that  Halon  extinguishing  agents  may  be  unavailable 
in  the  future,  with  an  increased  reliance  on  water  based 
extinguishing  systems.  From  the  standpoint  of  Air  Force 
interests,  if  there  is  a  conversion  from  more  volatile  JP-4 
fuel  stocks  to  less  volatile  JP-8  fuel;  and  if  JP-8  actually 
turns  out  to  be  prone  to  unanticipated  increased  vaporization 
rates  in  the  presence  of  water,  due  to  azeotropic  effects, 
the  need  for  an  in  depth  evaluation  of  this  effect  would 
assume  even  greater  dimensions  of  urgency.  It  should  also  be 
noted  that  the  Navy  is  now  using  low-volatility  JP-5  fuel, 
and  that  commercial  aircraft  are  now  exclusively  fueled  with 
low  volatility  Jet  A-1  [essentially  identical  to  JP-8]. 
Thus,  need  can  be  established  for  examination  of  the 
azeotroping  effects  from  the  standpoint  of  Navy  and 
commercial  aviation  interests,  as  well.] 


22-20 


21 


In  summary,  the  following  implications  pertain  for 
azeotropic  water  effects  in  operational  firefighting 
considerations : 

(1)  Application  of  water  onto  burning  fuels  insoluble 
in  water  will  result  in  an  increase  rate  of 
volatilization  of  the  fuel,  and  a  correspondingly 
increased  fire  intensity  will  result. 

(2)  The  effect  is  particularly  pronounced  for  less 
volatile  "fire-safe"  fuels  such  as  JP-8,  JP-5  and 
Jet  A-1.  (See  Figures  9  and  10.) 

(3)  Due  to  unexpected  high  increases  in  rates  of 
volatilization  which  can  result  with  low  volatility 
fuels  on  application  of  AAAF,  water  fog  or  other 
water-based  firefighting  agent,  it  may  be  best  to 
use  halon  or  alternative  halons  for  supplentary 
extinguishment . 

(4)  A  need  exists  for  increased  firefighter  awareness 
of  unexpectedly  high  increases  in  rates  of 
volatilization  for  low  volatility  fuel  fires,  when 
using  water-based  extinguishing  agents. 

(5)  Water  suspended  in  the  fuel  before  the  fire  will 
not  materially  affect  the  flash  point. 
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Figure  9.  Water  used  in  extinguishment  of  low  boiling  point 
(high  volatility)  fuel  fires:  anomalous  effect. 

(E.q. ,  JP-4  fires. ) 


Hilt 


lUfiir 


.V 


4 


ik 


1l‘l  1 


!  '7 


? 

I 


Figure  10.  Water  used  in  extinguishment  of  high  boiling 
point  (low  volatility)  fuel  fires:  increased  fire 
intensities!  (E.^. ,  JP~8,  JP-5,  Jet  A-1  fires.) 
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2.  Firefighting  Training  Considerations 

In  typical  firefighting  training  exercises,  a  large  fire 
pit  is  partially  filled  with  water  to  provide  a  flat  surface 
for  fuel  which  is  then  layered  to  a  depth  of  an  inch  or  so 
over  the  water  (the  flat  water  surface  minimizes  fuel  volume 
requirements)  .  JP-4  is  typically  used  for  CONUS  Air  Force 
training  requirements;  JP-8  for  USAFE  training;  Jet  A-1  for 
commercial  aviation  training  requirements,  and  JP-5  for  Navy 
training) .  A  diagram  of  a  typical  fire  pit  assembly  is  shown 
in  Figure  11. 

As  can  be  seen  in  Figure  11,  there  is  therefore  a  very 
sharp  temperature  gradient  in  the  very  thin  layer  of  burning 
liquid  fuel.  At  the  utmost  surface,  the  fuel  temperature 
will  be  at  its  boiling  point  (i* e. ,  139°  C  or  282°  F  for 
xylene);  but  an  inch  or  so  below  this,  at  the  interface  of 
the  fuel  layer  with  the  underlying  water,  the  temperature 
will  have  dropped  to  the  ambient  water  temperature  (typically 
no  more  than  90°  F) .  Therefore,  almost  all  of  the  fuel  will 
be  at  a  temperature  which  is  far  below  its  azeotropic  boiling 
point  (in  the  case  of  xylene,  203°  F) . 

Thus,  for  even  the  most  non-volatile  hydrocarbon  fuel 
such  as  JP-5,  there  will  be  no  observed  increase  in  rate  of 
volatilization  o  f  the  burning  fuel  when  water-based 
extinguishing  agents  are  applied  to  the  firel 
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It  is  therefore  reconunended  that  firefighting  tests  be 
conducted  with  water  fog  and  AFFF  on  JP-8  fires  in  a  100- foot 
fire  put  with  water  layer  present.  The  pit  should  contain 
three-dimensional  metal  structures  above  the  fuel  level,  to 
provide  real-life  hot  spots  for  heating  incoming  water. 

It  is  also  recommended  that  existing  film  and  video 
coverage  on  JP-8/ JP-5/ Jet  A-1  fires  be  studied  to  ascertain 
the  extent  of  azeotropic  stecun  distillation  effects  which  may 
have  been  experienced  in  previous  commercial  and  military 
aircraft  or  POL  fires. 
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ABSTRACT 


Physiological  and  behavioral  adjustments  to  environmental  stressors  can  provide 
information  on  the  well-being  of  animals,  in  a  study  of  the  physiological  effects  of  jet 
aircraft  noise,  adult,  mixed  breed,  domestic  male  and  female  goats  were  allowed  to  adjust 
to  pens  and  blood  sampling  procedures  prior  to  measuring  the  stressor  effects  on  cardiac 
frequency,  piasma  cortisoi,  and  white  blood  cell  counts.  The  single-event  and  multiple- 
event  stimuli  were  provided  by  a  noise  simulation  system  that  exposed  animals  to  noise 
intensities  that  nominally  would  be  encountered  during  jet  aircraft  operations.  Goats 
were  chosen  as  animal  models  because  they  are  easily  managed  and  maintained,  readily 
available,  comparatively  inexpensive,  ruminants,  and  ideally  suited  as  a  wildlife  analog. 
As  a  result  of  the  study  reported  here,  the  usefulness  of  a  locally  fabricated  radio 
transmitter  as  a  non-invasive  technique  to  evaluate  heart  rates  of  dometic  goats  was 
successfully  demonstrated  through  comparison  with  data  obtained  from  commercially 
available  heart  rate  monitors.  Although  behavioral  responses  to  noise  stimuli  were 
observed,  heart  rate  responses  revealed  no  information  that  was  not  apparent  from 
changes  in  behavior.  Heart  rates  were  increased  55-93%  in  noise-exposed  compared  to 
control  goats.  Also,  an  indirect,  but  non-significant,  correlation  between  the  noise 
exposure  interval  and  net  percent  increase  in  heart  rate  was  observed.  Attempts  to 
identify  heart  rate  as  a  predictor  of  plasma  cortisol  were  unsuccessful.  Also,  no 
statistically  significant  changes  in  plasma  cortisol,  lymphocytes,  or  neutrophils  were 
detected  in  goats  following  exposure  to  single-event  or  multiple-event  noise  intensities. 
Nonetheless,  an  overall  assessment  of  the  results  suggests  that  additional  studies  on  the 
effects  of  noise  on  domestic  goats  are  needed. 
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INTRODUCTION: 


I . 


Since  the  introduction  of  jet  aircraft  into  its  inventory,  the  U.  S.  Air  Force 
(USAF)  has  had  to  deal  with  complaints  from  the  public  regarding  subsonic  and 
supersonic  noise.  Public  concern  over  noise  pollution  has  been  exemplified  by 
campaigns  to  restrict  flights  of  military  aircraft  at  supersonic  speeds  over  areas  of 
high  human  population  (Ewbank,  1977).  The  result  is  that  many  of  those  flights  now 
are  over  the  sea  or  other  areas  of  low  human  density  (e.g.,  National  Parks,  wildlife 
conservation  areas). 

In  the  1960-70S,  the  focus  of  many  noise  studies  was  on  occasional  booms  which 
did  not  appear  to  have  much  effect  on  an  animal's  behavior,  except  for  "startle"  response 
(Bell,  1972).  All  species  studied  (e.g.,  cattle,  sheep,  horses,  pigs,  poultry)  appeared  to 
physiologically  adjust  to  sonic  booms  (Casady  and  Lehman,  1967;  Espmark  glaL  1974; 
Ewbank,  1977;  Shotton,  1982)  and  subsonic  flight  (Cotterau,  1978)  with  no  loss  of 
productivity.  In  general,  behavioral  responses  were  minimal  except  for  the  avian 
species.  For  the  latter,  animals  "scatter"  or  "crowd"  (Ewbank,  1977)  or  tend  to  exhibit 
an  "alert"  reaction  and  "movement"  away  from  the  source  of  the  sound  (Bond,  1971). 
Sonic  booms  cause  a  "startle"  response  in  cattle  and  sheep  but  this  decreases  as  the 
duration  of  the  exposure  increases;  sheep  appear  to  be  more  disturbed  by  sonic  booms 
when  they  are  standing  compared  to  lying  (Espmark  filal,  1974).  Little  information  is 
available  on  the  effects  of  noise  from  subsonic  flights  or  sonic  booms  on  goats. 

Research  trials  conducted  at  Tyndall  AFB,  Florida,  in  the  Summer  of  1989 
assessed  the  response  of  domestic  goats  to  noise  from  low-flying  jet  aircraft.  These  field 
studies  were  part  of  the  UES  Summer  Faculty  Research  Program  (SFRP)/Graduate 
Student  Research  Program  (GSRP).  The  Final  Reports  of  the  Summer  Fellow  (Besch, 
1989)  and  Graduate  Student  Researcher  (Zern,  1989)  described  the  difficulties 
associated  with  conducting  field  research.  In  particular,  it  was  very  difficult  to  control 
variables  (e.g.,  weather,  noise  intensity  and  duration,  feed  and  water  intake)  which  can 
modify  animal  function  and  response  to  the  noise  stressor.  Wh«!e  there  is  evidence  to 
suggest  that  some  of  the  goats  were  physiologically  affected  by  the  jet  aircraft  noise, 
others  displayed  responses  that  appeared  to  be  unrelated  to  the  noise  stressor. 

Although  there  were  no  significant  differences  between  the  baseline  heart  rates 
between  the  two  groups  of  TAFB  goats,  the  goats  located  near  the  jet  aircraft  runway 
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(Group  1)  displayed  a  decreasing  heart  rate  over  time  which  was  consistent  with  their 
behavior.  The  control  group  (Group  2)  of  goats  whose  pens  were  in  a  semi-isolated 
location  on  the  base,  maintained  an  elevated  heart  rate  and  never  seemed  to  adjust  to 
their  new  environment,  displayed  apprehensiveness,  proved  difficult  to  manage,  and 
exhibited  an  uneasiness  at  the  sight  of  people.  Nevertheless,  only  the  Group  1  goats  were 
exposed  to  elevated  noise  levels  associated  with  takeoffs  and  landings  of  jet  aircraft,  but 
both  groups  displayed  hematological  changes  (e.g.,  relative  lymphopenia  and 
neutrophilia)  that  are  considered  to  be  adrenocortical-mediated  stressor  responses. 

From  the  above  observations  it  is  difficult  to  separate  the  physiological  and 
behavioral  responses  due  to  the  jet  aircraft  noise  stressor  from  the  responses  due  to 
other  factors  such  as  difference  in  site  location  (e.g..  Group  2  animals  may  have  been  too 
isolated  from  humans  or  too  close  to  the  track  or  trail  of  predator  animals  such  as  a 
coyote).  It  also  is  possible  that  the  observed  changes  in  the  goats  represent  a  summation 
of  responses  due  to  simultaneous  exposure  to  multiple  stressors  (e.g.,  noise,  isolation, 
new  surroundings).  One  way  to  deal  with  the  latter  is  to  perform  experiments  under 
laboratory  conditions  where  all  variabies-including  the  experimental-can  be 
properly  controlled.  In  this  way,  the  relationship  between  one  experimental  variable 
(i.e.,  noise  intensity)  and  the  goat's  response  (i.e.,  physiological  strain)  could  be 
separated  from  other  potential  stressor  responses.  This  was  the  basis  for  the  proposal 
that  led  to  the  research  reported  here. 

1 1 .  OBJECTIVES 

A.  GOALOF  RESEARCH 

The  primary  goal  of  this  research  was  to  determine  the  effects  of  simulated  jet 
aircraft  noise  on  the  physiological  well-being  of  domestic  goats.  The  goat  was  chosen  as 
the  animal  model  because  it  is 

A.  An  ubiquitous  domestic  animal. 

B.  Easily  managed  and  maintained. 

C  Readily  available. 

0.  Comparatively  inexpensive. 

E  A  ruminant. 

F.  Ideally  suited  as  a  wildlife  analog. 
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B.  SPECIFIC  OBJECTIVES 


To  attain  the  research  goal,  the  experimental  plan  was  subdivided  into  four 
specific  research  objectives  as  follows: 

A.  VAUDATION  OF  A  HEART  RATE  RADIO  TRANSMITTER 

B.  HEART  RATE  AS  A  PREDICTOR  OF  PLASMA  CORTISOL  LEVELS 

a  BEHAVIORAL  RESPONSES  OF  GOATS  TO  SIMULATED  JET  AIRCRAFT  NOISE 
D.  PHYSIOLOGICAL  RESPONSES  OF  GOATS  TO  SIMULATED  JET  AIRCRAFT  NOISE 

C  ANIMAL  CARE  AND  USE  APPROVAL 

The  goats  used  in  the  research  described  here  were  obtained  from  and  health  care 
was  provided  by  the  University  of  Florida,  Health  Center  Animal  Resources  Unit 
(HCARU).  Animal  care  and  use  procedures  were  in  accordance  with  national  standards  as 
described  In  the  Guide  for  the  Care  and  Use  of  Laboratory  Animals  (ILAR,  1985)  and 
Guide  for  the  Cafe...and  Use.  Qf  Agricultural  Animals  in  Agricultural  Research  and 
Teaching.  (Anonymous,  1988).  The  research  plan  and  protocols  were  reviewed  and 
approved  by  the  University  of  Florida  Animal  Care  and  Use  Committee  prior  to  initiation 
of  animal  experimentation. 

III.  OBJECTIVE  NO.  1:  VALIDATION  OF  A  HEART  RATE  RADIO  TRANSMITTER 
A  INTRODUCTICX^J 


Heart  rate  has  been  used  as  an  indicator  of  stressor  response  because  it  is 
correlated  with  energy  expenditures  (Richards  and  Lawrence,  1984)  and  is  a  sensitive 
measure  of  arousal  (MacArthur  £l^  1982).  It  also  has  been  reported  (Harlow  filal, 
1 987)  that  remote  monitoring  of  cardiac  frequency  can  be  used  as  a  predictor  of  adrenal 
function  and,  therefore,  the  potential  immunologic  condition  ^f  an  animal  during 
exposure  to  an  environmental  stressor. 

But,  commercially  available  radiotelemetry  and  electronic  devices  for  measuring 
heart  rate  are  expensive.  Further,  radiotelemetry  requires  a  signal  receiver  unique  to 
the  transmitter  and  duration  of  use  is  limited  by  the  operating  life  of  the  transmitter's 
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battery  while  electronic  devices  are  limited  by  the  operational  life  of  the  audio  cassette 
on  which  heart  rate  is  recorded.  Also,  radio  transmitters  require  surgical  implantation 
which  exposes  the  animal  to  some  trauma  and  necessitates  recovery  time. 

These  objections  could  be  overcome  with  a  reliable,  extended-use,  and  'externally 
mounted  radiotelemetry  device  for  measuring  heart  rates  in  unrestrained  ar  •  .  s.  A 
radio  transmitter  which  fulfilled  these  requirements  was  designed,  fabricated,  ai'id  used 
to  measure  heart  rates  in  unrestrained  domestic  goats.  Heart  rate  data  were  compared  to 
those  simultaneously  obtained  from  the  same  animal  using  a  commercially  available  and 
externally  mounted  electronic  heart  rate  monitor.  The  results  of  those  comparisons, 
together  with  a  description  of  the  heart  rate  transmitter,  are  reported  here. 

B.  MATERIALS  AND  METHODS 

Animals:  Four  mixed-breed  male  and  female  goats  with  a  body  mass  of  22.4  ±  1 .0kg 
(Mean  ±  SE)  and  between  6  and  13  months  of  age  were  used  in  this  study.  Of  these,  1 
male  and  1  female  came  from  one  subgroup  of  6  goats  and  1  male  and  1  female  came  from 
another  subgroup  of  7  goats. 

Animal  environment:  Each  subgroup  of  goats  was  housed  in  a  separate  enclosed  pen  made 
from  converted  horse  stalls  which  measured  about  3.7m  in  length  and  7.6m  in  width. 
Thus,  each  goat  was  provided  at  least  4.7m2  of  pen  space.  The  pens  contained  cathedral 
ceilings  and  were  ventilated  with  outside  air  through  3  haif-doors  (upper  half  of  door 
always  remained  open)  and  thermostatically-controlled  window-mounted  exhaust  fans 
provided  air  movement.  Air  temperature  in  each  pen  was  monitored  with  thermistors 
(Omega,  Type  T,  Stamford,  CT)  placed  in  three  different  locations,  1 .5m  above  the  floor. 
Air  temperature  from  each  thermistor  was  recorded  using  a  data  acquisition  system 
(Model  2200B,  John  Fluke  Mfg  Co..  Everett,  WA)  every  60  minutes.  There  were  no 
statistically  significant  (P>0.05)  differences  between  goat  pen  temperatures  which 
cycled  daily  between  21 .1  ±  0.9  (Mean  ±  SE)  and  33.0  ±.  O.S^C  (Control)  and  22.2  ± 
0.9  and  32.5  ±  0.6^C  (Experimental)  during  each  3.5  wk  data  collection  period. 

Feed  and  Water:  Each  group  of  goats  was  fed  (0.34kg/goat/da)  goat  chow  (Purina 

Mills,  Inc,  Goat  Chow®)  and  hay  each  morning.  All  goat  chow  was  weighed  on  a 
temperature  compensated  spring  scale  (Chatillon,  Type  027).  Water  was  changed  each 
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morning  (7:30-8:30  am)  and  evening  (5:30-6:30  pm)  and  was  provided  to  the  goats  ad 

libitum. 


Telemetry  System:  The  heart  rate  telemetry  system  (Figure  III.1)  was  a  modification 
of  the  one  used  to  measure  body  temperature  (Varosi.  alai.  1990).  The  modifications 
included  the  addition  of  a  lowpass  filter  (LPF),  a  one-shot  trigger  circuit,  and  a  newly 
designed  serial  interface.  The  RF  signal  from  the  heart  rate  transmitter  was  received  by 
an  antenna  and  amplified,  as  necessary,  for  long  coaxial  cable  transmissions.  The 
antenna  was  a  RG59/U  female  connector  with  a  1-2m  center  wire  and  two  1-2m  ground 
plane  wires  soldered  to  it.  The  signal  from  the  antenna  was  fed  into  a  programmable 
receiver/scanner  (PRO-2004,  Radio  Shack,  Ft.  Worth,  TX).  The  received/scanner  had 
pre-programmed  channels  for  the  unique  frequencies  of  the  individual  radio 
transmitters.  The  audio  output  of  the  receiver/scanner  was  fed  to  a  lowpass  filter  and 
one-shot  trigger  circuit  which  produced  a  pulse  waveform  that  can  be  counted  by  .  e 
frequency  counter  (Model  1910A,  John  Fluke  Mfg  Co.,  Everett,  WA).  A  serial  interface 
(fabricated  in  the  laboratory)  converted  the  digital  output  of  the  freque.ncy  counter  to  a 
RS-232  serial  port  for  interfacing  with  an  IBM-compatible  personal  computer  (PS/2 
Model  50,  IBM,  Boca  Raton,  FL)  where  it  was  recorded  each  minute  and  stored  in  ASCII 
format  for  further  analysis.  Software,  written  in  BASIC,  controlled  data  acquisition, 
data  storage,  and  sequencing  of  the  receiver/scanner  to  the  next  programmed  channel. 

Heart  Rate  Monitors  and  Transmitters:  Heart  rate  simultaneously  was  measured  in  each 
goat  for  a  period  of  at  least  2  hr  using  a  battery-operated  electrocardiocorder®  cassette 
recorder  (Model  456B,  Del  Mar  Avionics,  Irvine,  CA)  and  a  heart  rate  FM  transmitter. 
The  electrocardiocorder®  contained  an  audio  tape  cassette  (Hotter  24-hr  cassettes,  Del 
Mar  Avionics,  Cincinnati,  OH)  on  which  the  heart  rate  was  recorded,  each  minute,  for 
subsequent  analysis  by  computer  (Hotter  Analysis  System,  IBM  PC  Model  152,  Del  Mar 
Avionics,  Irvine,  CA).  The  FM  radio  transmitter  (Figure  III.2)  was  similar  to  the  one 
used  to  measure  body  temperature  except  that  the  temperature  sensitive  oscillator  has 
been  replaced  with  a  high  gain  electrocardiograph  (ECG)  amplifier  requiring  a  4-volt, 
center-tapped  power  supply  (BT1,  2,  3).  A  remote-controlled  power  switch  also  was 
included  but,  because  of  the  center-tapped  power  supply,  only  the  transmitter  was 
turned  ’on’  and  "off”;  the  ECG  amplifier  section  was  left  "on"  continuously  with  a 
quiescent  current  drain  of  only  0.01  mA.  By  incorporating  a  MOSFET  (Varosi  filalt 
1989),  both  the  transmitter  and  amplifier  sections  could  be  switched  "on"  and  "ofT  but 
this  would  only  complicate  fabrication.  The  ECG  amplifier  was  constructed  with  a  CMOS 
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Figure  IIl.l. 

BASIC  COMPONENTS  OF  TELEMETRY  SYSTEM. 
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FM  TRANSMITTER 


Figure  III. 2.  ECG  TRANSMITTER 


op-amp  which  had  an  extremely  high  input  impedance  and  very  low  power  drain.  The 
first  stage,  U2A,  was  a  high  impedance  unity  gain  buffer  which  prevents  skin  and 
electrode  resistance  from  attenuating  the  ECG  signal  before  amplification.  The  output  of 
U2A  was  amplified  by  U2B  with  a  gain  of  440.  The  output  of  U2A  modulated  the  FM 
transmitter  with  the  ECG  profile  detected  at  the  ECG  input.  The  FM  transmitter  carrier 
frequency  was  tunable,  via  trimmer  capacitor  Cl ,  between  88-1 70  MHz. 

Lowpass  Filter:  The  scanner  audio  output  contained  high  frequency  noise  as  well  as  the 
desired  low  frequency  ECG  signal.  To  extract  the  ECG  signal,  a  three-stage  lowpass  filter 
(Figure  III.3)  using  OP-amps  UIA,  U2A,  and  USA  was  employed.  Each  identical  stage 
was  a  second-order,  unity  DC  gain,  active  lowpass  filter  with  a  -3dB  cutoff  at  48Hz. 
The  output  of  the  third  stage  was  amplified  by  U1B  with  adjustable  gain  of  1  to  21.  The 
output  of  U1B  was  the  filtered  ECG  signal.  At  this  point,  an  oscilloscope  was  used  to  view 
the  signal  to  ensure  that  the  highest  peak  of  the  signal  was  positive,  which  should  be  the 
case  provided  the  leads  are  properly  placed. 

One-Shot  Trigger:  The  ECG  signal  was  compared  by  U4B  to  a  level  preset  by  a  10k 
potentiometer  (Figure  1 1 1.3)  to  reduce  the  possibility  of  muscle  artifact  and  other  noise 
being  interpreted  as  an  ECG  signal.  The  ECG  signal  usually  was  much  larger  than  the 
other  signals  and  setting  the  comparator  level  to  75%  of  the  highest  positive  ECG  peak 
was  sufficient  to  reduce  false  triggers.  To  further  reduce  the  incidence  of  false  triggers, 
a  one-shot  trigger  circuit  U3A  and  U3B  was  introduced.  When  triggered,  the  output  U3B 
went  high  (+8V)  and  stayed  high  for  a  period  set  to  about  50-75%  of  the  ECG  signal 
period  (adjusted  by  a  1M  potentiometer)  during  which  the  circuit  was  immune  to 
further  triggering.  This  circuit  reduced  the  possibility  of  false  triggers  between  ECG 
pulses  from  noise  and  secondary  peaks  of  strong  ECG  signals.  The  one-shot  trigger 
output  was  sent  to  the  frequency  counter  which  counted  positive  rising  edges  of  the  pulse 
waveform  which  corresponded  to  ECG  pulses. 

External  mounting  of  heart  rate  monitors  and  radio  transmitters:  After  shaving  an 
approximate  13cm  X  13cm  area  on  each  side  of  a  goat's  thorax,  both  areas  were 
thoroughly  cleaned  with  70%  alcohol  saturated  using  4cm  X  4cm  gauze  pads.  Before 
recording  heart  rate,  flexible  elastic  bandaging  tape  (Flexo®,  Horse  Health  Products, 
Inc.,  Aiken,  SC)  was  wrapped  around  the  animal  to  protect  the  heart  rate  monitor.  FM 
transmitter,  electrodes,  and  electrode  leads  from  dislocation  and  from  the  other  goats.  At 
the  end  of  each  sampling  period,  the  monitor,  radio  transmitter,  and  electrodes  were 
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Figure  III. 3.  LOW  PASS  FILTER  AND  ONE  SHOT  TRIGGER 


carefully  removed  and  the  attachment  sites  cleaned  with  alcohol.  Because  the  electrode 
attachment  procedure  can  cause  the  skin  to  become  irritated,  a  period  of  at  least  3  days 
elapsed  between  heart  rate  measurements. 

1 .  Heart  Rate  Monitor:  Leads  from  the  electrocardiocorder®  were  attached  to 
silver/sHver  chloride  electrocardiogram  (ECG)  monitoring  electrodes  (WA580 
Foam  Electrodes,  MedTek,  St  Petersburg.  FL)  placed  over  the  scapular  and  heart 
apex  areas.  On  the  left  side,  the  first  electrode  was  placed  over  the  apex  of  the 
heart;  the  second  over  the  caudal  border  of  the  scapula;  and  the  third  mid-way 
between  the  apex  and  scapula.  On  the  right  side,  the  first  electrode  was  placed 
over  the  apex  and  the  second  over  the  caudal  border  of  the  scapula.  The 
electrocardiocorder®  was  connected  to  the  five  leads  and  tested  for  operation 
using  an  electrocardiograph  (Model  EK/5A,  Burdick  Corp.,  Milton,  Wl)  for 
stripchart  ECG  recordings.  The  clock  inside  the  heart  rate  recorder  was 
synchronized  with  the  clock  on  the  computer  (IBM  Model  P/S  2-50)  which 
accessed  the  radio  signals  from  the  radio  transmitters. 

2 .  Radio  transmitter:  Radio  transmitter  leads  were  kept  clipped  together  to  protect 
the  amplifier  section  from  overload  by  external  fields.  With  the 
receiver/scanner  tuned  to  the  appropriate  radio  frequency,  the  transmitter  was 
switched  to  the  "on"  position  with  the  electromagnetic  pulse  generator 
(electromagnetic  pulse  generator)  and  the  receiver/scanner  was  fine-tuned 
until  no  static  (i.e.,  "white  noise")  was  heard.  The  leads  then  were  undipped  and 
connected  to  the  ECG  monitoring  electrodes:  the  right  lead  over  the  right  apex  of 
the  heart  and  the  left  lead  over  the  caudal  border  of  the  scapula.  Because  the 
transmitter  is  "turned-off  when  the  leads  are  undipped,  it  must  be  "turned-on" 
with  the  pulser.  A  storage  oscilloscope  (Model  7623A,  Tektronix,  Beaverton. 
OR)  was  used  to  observe  ECG  waveforms.  After  all  transmitters  were  fitted  and 
checked,  the  telemetry  system  was  switched  to  sequentially  monitored  heart 
rates.  Heart  rate  for  each  animal  was  counted  for  lOsec  within  a  20sec  interval. 

Data  collection:  After  the  radio  transmitters  and  heart  rate  monitors  were  externally 
mounted,  the  animals  were  placed  into  their  respective  pens.  Although  heart  rate  data 
were  collected  for  a  minimum  of  2  hr,  analysis  involved  only  a  30  min  segment  which 
began  60  min  after  the  mounting  of  the  radio  transmitters  and  monitors.  The  60  min 
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delay  provided  a  sufficient  period  of  time  for  the  animals  to  adjust  physiologically  to  the 
instrumentation  and  to  obtain  stable  heart  rate  values. 

Statistical  analysis:  Heart  rate  data  virere  analyzed  using  a  commercially  available 
personal  computer  software  program  (Stat  View  512;  Winer,  1971), 

a  RESULTS  AND  DISCUSSION 

After  placement  of  the  heart  rate  measuring  devices,  the  animals  were  returned 
to  their  respective  pens.  Although  heart  rates  were  recorded  for  a  period  of  2  hr,  initial 
analyses  involved  both  30  min  and  60  min  mean  heart  rates  taken  from  each  animal 
following  a  60  min  period  of  readjustment  to  their  pens  and  heart  rate  m  asuring 
devices.  The  relationship  between  the  30  min  and  60  min  mean  heart  rates  obtained 
using  the  heart  rate  monitors  is  described  in  Figure  I II. 4.  Correlations  also  were  made 
between  the  30  min  and  60  min  mean  heart  rates  obtained  using  radio  transmitters 
(Figure  III.5).  These  relationships  suggest  that  either  the  30  min  or  60  min  mean 
heart  rate  values  could  be  used  in  subsequent  analyses. 

The  relationship  between  30  min  mean  heart  rate  values  obtained  from  radio 
transmitters  and  monitors  in  control  goats  is  described  by  the  equation: 

y  =  0.808X  +  23.934  (  1  ) 

v.'here:  y  =  30  min  mean  heart  rate  from  radio  transmitter,  and 

X  =  30  min  mean  heart  rate  from  monitor 

Further,  the  radio  transmitter  data  were  highly  correlated  (r  =  0.904;  r^  »  0.817; 
p<0.0001)  with  those  obtained  from  the  heart  rate  recorders.  Although  there  is  no 
consistent  trend,  the  percent  difference  between  these  two  devices  (Table  lll.l)  may  be 
partly  related  to  the  fact  that  the  radio  transmitter  signals  represent  time-averaged 
heart  rates  (i.e.,  over  a  30-sec  period)  whereas  the  heart  rate  monitor  records  beat- 
to-beat  heart  rates.  It  also  may  be  that  with  minor  modifications  of  the  transmitter 
circuitry,  the  percent  differences  could  be  reduced.  A  non-significant  (p  >  0.05) 
degradation  of  signal  strength,  proportional  to  operational  use  of  the  radio  transmitter, 
was  observed. 


23-14 


mean  heart  rates 


mean  heart  rates 


age) 


Table  III.1.  Comparison  of  heart  rates  (Mean  ±  SE)  obtained  from  four  goats  using  two 
heart  rate  monitors  and  six  radio  transmitters  for  each  animal. 


Percent 

Animal _ q} _ Radio  transmitters _ Heart  Rate  Monitors  Difference^ 


Goat  No.  53 

5 

126.2  ±.  2.2 

126.9  ±.  3.1 

+0.6 

Goat  No.  57 

8 

116.7  ±.  2.5 

1 12.9  ±  3.4 

-3.4 

Goat  No.  59 

6 

107.3  ±  1.2 

104.0  ±  3.2 

-3.2 

Goat  No.  60 

6 

119.6  ±  6.2 

122.4  ±  5.2 

+2.3 

Mean  of  above  groups 

117.0  ±  2.0 

111.8  ±  5.0 

-4.6 

^  Number  of  observations 

2  Heart  Rate  Monitor  Value  -  Transmitter  Value  x  100 
Heart  Rate  Monitor  Value 


Utilization  of  radio  transmitters  rather  than  heart  rate  monitors  to  measure 
heart  rates  of  homoiothermic  animats  provides  many  advantages.  For  example,  in 
comparing  costs  of  the  two  techniques,  transmitters  can  be  fabricated  in  the  laboratory 
at  a  unit  cost  of  about  $25  whereas  the  heart  rate  monitors  nave  a  unit  price  of  about 
$1500.  There  is  a  similar  cost  differential  for  necessary  support  equipment.  Thus, 
using  radio  transmitters,  heart  rates  from  a  large  number  of  animals  can  be  obtained 
virtually  simultaneously  at  a  relatively  low  unit  or  total  cost. 

D.  CX3NCLJLJSIONS 


The  usefulness  of  the  locally  designed  and  fabricated  radio  transmitter  as  a  non- 
invasive  technique  to  evaluate  heart  rates  of  domestic  goats  was  successfully 
demonstrated  through  comparison  with  data  recorded  from  a  commercially  available 
heart  rate  monitor.  The  descriptions  of  the  telemetry  system,  radio  transmitter, 
lowpass  filter,  and  one-shot  trigger  also  have  been  provided. 

IV.  OBJECTIVE  NO.  2;  HEART  RATE  AS  PREDICTOR  OF  PLASMA  CORTISOL  LEVELS 
A  INTRODUCTION 


When  animals  are  exposed  to  conditions  for  which  they  are  neither  accustomed 
nor  physiologically  adapted,  they  experience  a  stressor  response  which  is  characterized 
by  elevated  serum  or  plasma  glucocorticoids.  The  elevation  of  glucocorticoids  is  a 
protective  response  of  the  body  for  mobilizing  glucose  to  meet  the  physiological  demands 
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induced  by  a  stressor.  Thus,  plasma  corticosteroids  have  been  used  as  indicators  of 
stressor  response  (Selye,  1950). 

But,  their  reliability  as  stress  indicators  has  been  challenged  because  many 
factors  can  cause  an  elevation  of  these  substances.  For  example,  procedures  associated 
with  collecting  blood  samples  (e.g.,  venipuncture,  animal  handling)  have  been  shown  to 
cause  elevated  level  of  plasma  corticosteroids  (Bassett  and  Hinks,  1969).  Lymphopenia 
and  neutrophilia  also  have  been  reported  to  be  stressor  responses  (Dalton  and  Selye, 
1939)  but  these  values  also  have  been  reported  to  be  influenced  by  administration  of 
cortisol  in  the  horse  (Burguez  fiiai,  1983). 

So,  the  purpose  of  the  study  reported  here  was  to  determine  whether  heart  rate 
could  be  used  as  a  predictor  of  stressor  response  in  goats.  Availability  of  such  a 
predictor  would  enhance  collection  of  data  on  the  physiological  consequence  of  exposing 
animals  to  stressors.  Comparisons  were  made  between  heart  rates  and  plasma  cortisol 
levels  and  white  blood  cell  changes  in  goats  exposed  to  minimal  (i.e.,  <50  dB(A))  and 
those  exposed  to  elevated  noise  (i.e.,  105  dB(A))  levels.  Plasma  cortisol  was  measured 
because  it  is  considered  to  bo  the  predominant  corticosteroid  in  ovine  (Bassett  and 
Hinks,  1969)  and  caprine  (Lindner,  1964). 

B.  MATERIALS  AND  METHODS 

Animal  environment,  feeding  and  watering  schedules,  heart  rate  measurements, 
external  mounting  of  transmitters,  and  telemetry  system,  were  as  described  for  the 
Objective  entitled  VALIDATION  OF  A  HEART  RATE  RADIO  TRANSMITTER  (See  Section  III.B 
and  III.C  above).  A  total  of  12  mixed-breed  goats  (6  male  and  6  female)  was  used. 

Noise  Exposure:  The  goats  were  exposed  to  single  event  and  multiple  event  A- 

weighted  noise  levels  (dBA)  as  described  in  the  Noise  Simulation  System  (NSS)  User’s 
Manual  (Chavez  filal,  1990).  The  single  event  Equivalent  Sound  Level  (LEO)  for  the 
experimental  group  was  105.3  dBA  for  the  aircraft  flyover  signal  ID  No.4  (F4D).  The 
control  group  noise  levels  did  not  exceed  50  dBA.  All  noise  exposures  were  separated  by 
at  least  6  days.  Multiple  event  exposures  consisted  of  3  single  event  at  6  min  intervals 
or  13  single  events  at  1  min  intervals  over  a  total  time  period  of  12  min.  Following 
completion  of  the  above,  the  location  of  the  groups  of  animals  was  reversed  and,  after  an 
interval  of  10  da  for  conditioning  the  animals  to  their  new  pens  (McNatty  and  Young, 
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1973),  the  sequence  was  replicated.  Because  of  this  crossover,  there  was  a  total  of  12 
control  and  12  experimental  animals. 

Blood  Sampling:  Blood  samples  (~4cc)  from  the  jugular  vein  were  collected  in 

vacutainers  (Becton  Dickinson,  Reorder  No  6450)  containing  EDTA  as  an  anticoagulant. 
Samples  were  collected  at  about  the  same  time  intervals  each  day  of  collection  and  prior 
to  exposing  the  goats  to  a  noise  stressor  (Control  Samples)  and  at  ~20  min,  ~75  min, 
and  ~120  min  following  the  noise  stimulus  (Experimental  Samples).  Immediately 
following  collection,  the  vacutainers  were  placed  in  crushed  ice  until  hematologic 
measurements  were  completed,  usually  within  45-min  of  collection. 

Hematology:  Packed  cell  volume  (PCV)  was  determined  using  the  microcapillary 
hematocrit  tube  method  (Clay  Adams,  Autocrit  Centrifuge,  Model  No.  0551)  and  plasma 
protein  was  measured  using  a  hand-held  refractometer  (Schuco  No.  21300).  Blood 
smears  were  made  by  the  pull  slide  technique  and  stained  with  anthene,  thiazine,  azure 
A,  and  methylene  blue  dyes  (Harleco's  Diff  Quik  Stain).  Blood  samples  then  were  placed 
in  a  clinical  centrifuge  (Model  CL,  International  Equipment  Company)  and  the  plasma 
separated  from  the  fo'-med  elements.  Plasma  was  transferred  to  2  ml  capped 
polypropylene  vials  (Naige  Company,  Catalog  No.  5000-0020)  which  were  placed  in  a 
freezer  (-20°C)  until  plasma  cortisol  measurements  were  completed. 

Radioimmunoassay:  Radioimmunoassay  of  plasma  cortisol  was  completed  using  a 
commercially  available  kit  (Baxter  Travenol  Diagnostics,  Inc.,  Cortisol 
Radioimmunoassay  Kit). 

Statistical  analysis:  Data  were  analyzed  using  a  commercially  available  personal 
computer  software  program  (Stat  View  512;  Winer,  1971). 

C  RESULTS  AND  DISCUSSION 

Goat  plasma  cortisol  was  observed  to  be  0.90  ±.  0.13  ug/dl.  This  value  is 
qualitatively  similar  but  less  than  1.2  ug/IOOmI  reported  by  Lindner  fiiai(1964)  and 
is  about  10%  of  the  plasma  cortisol  found  in  humans  (Cohn  and  Kaplan,  1966). 

A  simple  regression  analysis  revealed  no  statistically  significant  correlation 
between  plasma  cortisol  and  heart  rates  obtained  from  radio  transmitters  (Figure  IV.1). 
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Figure  IV . 1 .  The  relationship  between  plasma  cortisol  and  heart  rate 


y  s  3.961E-4X  ♦  .488,  R-squared:  .002 
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This  finding  conflicts  with  a  previous  report  in  which  a  linear  relationship  between 
heart  rate  and  plasma  cortisol  was  suggested  for  bighorn  sheep  (Harlow  £iaL  1987). 
However,  in  the  latter  study,  the  relationship  was  observed  in  individual  vice  group 
mean  values  as  reported  herein.  Further,  in  the  Harlow  fiiaL  (1987)  study,  blood 
samples  were  obtained  via  indwelling  jugular  vein  cannulae.  Because  cannulae  were 
used,  blood  samples  could  be  obtained  rapidly  and  at  more  frequent  intervals  (i.e.,  5, 
19,  15,  20,  30,  48,  and  100  min  intervals  compared  to  only  three  intervals  (i.e.,  20, 
75,  and  120  min)  reported  herein. 

A  comparison  of  heart  rate  and  plasma  cortisol  changes  preceding  and  following 
exposure  of  goats  to  jet  aircraft  noise  is  displayed  in  Table  IV.1.  There  were  no 
detectable  increases  in  either  heart  rate  or  cortisol  for  any  of  the  post-noise  exposure 
periods  used  except  for  the  75  min  post-noise  heart  rate  which  is  thought  to  be 
unrelated  to  the  noise  stimulus.  Either  the  noise  stimuli  used  in  this  study  were  not  of 
sufficient  intensity  to  elicit  an  adrenal  response  or  the  response  was  of  such  short 
duration  that  it  was  undetectable  within  the  20  min  post-noise  plasma  sample. 

It  also  may  be  that  the  high  variability  in  plasma  cortisol  values  obscured  the 
possible  relationship  between  heart  rate  and  cortisol.  It  is  not  known  whether  that 
variability  was  due  to  the  inherent  characteristics  of  cortisol,  radioimmunoassay 
technique  relative  to  goat  plasma,  animal  species  ,  or  some  combination  of  those  factors. 

Table  IV.1.  Comparison  of  heart  rate  and  plasma  cortisol  changes  preceding  and  following 
exposure  of  goats  to  jet  aircraft  noise.  All  values  are  Mean  ±  SE. 

ConditLOD _ _ Heart  Rate _ Cortisol _ 


Pre-noise 

12 

125  ±  6 

0.71  ±  0.09 

20  min  Post-noise 

1  4 

137  ±8 

0.61  ±.  0.12 

75  min  Post-noise 

1  1 

146  i:  92 

0.70  i:  0.16 

120  min  Post-noise 

5 

119  _  14 

0.38  ±0.18 

^  Number  of  determinations  from  9  goats 
2  p  <  0.05  compared  to  pre-noise  condition 
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D. 


OONCXJUSIONS 


Because  of  the  advantages  of  using  cardiac  frequency  as  an  indicator  of  plasma 
cortisol,  further  studies  should  be  conducted  to  fulfill  the  intent  of  this  objective.  If 
additional  studies  are  conducted,  the  proto(x>ls  should  provide  for  repetitive  blood 
sampling  from  indwelling  cannulae  similar  to  the  procedure  employed  by  Harlow  £ial 
(1987). 

V.  OBJECTIVE  NO.  3:  BEHAVIORAL  RESPONSES  OF  GOATS  TO  SIMULATED 

JET  AIRCRAFT  NOISE 

A  INTRODUCTION 

There  have  been  many  reports  on  the  behavioral  responses  of  various  domestic 
animals  (Bond.  1971;  Ewbank,  1977)  and  wildlife  (Ames,  1978)  to  noise  and  sonic 
booms.  In  domestic  animals,  the  representative  behavioral  signs  include  startle  reflex, 
avoidance  behavior,  and  vocal  repetition  rates  (Espmark  aiai,  1974;  Ruth.  1976).  Ail 
species  studied  appear  to  readily  adapt  to  sonic  booms  and  display  behavioral  startle  but 
no  loss  of  production:  cattle  exhibit  brief  alterations  in  behavior,  sheep  temporarily 
stop  grazing,  ponies  react  to  a  greater  extent  than  cattle,  pigs  show  little  response,  and 
poultry  display  the  most  pronounced  response  of  all  observed  domestic  animals 
(Ewbank,  1977). 

Heart  rate  changes  and  avoidance  behavior  have  been  used  to  evaluate  goat 
encounters  with  people  (Lyons  and  Price,  1987).  Although  heart  rate  telemetry  is 
more  objective,  it  also  is  more  costly  and  behavioral  cues  have  been  found  to  be  useful  in 
assessing  an  animal's  response  to  human  disturbances.  No  responses  to  disturbances 
were  detected  using  heart  rate  telemetry  that  were  not  apparent  from  behavioral  cues 
alone  (MacArthur  filal  1982). 

The  purposes  of  the  study  reported  here  were  to  describe  the  behavioral 
responses  of  domestic  goats  to  single  and  multiple  jet  aircraft  noise  exposures  and  to 
determine  whether  the  behavioral  responses  could  be  detected  in  the  absence  of 
measurable  physiological  responses. 

B.  MATERIALS  AND  METHODS 
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Animal  environments,  feeding  and  watering  schedules,  heart  rate  measurements, 
external  mounting  of  transmitters,  and  telemetry  system,  all  were  as  described  for 
Objective  No.  1  entitled  VALIDATION  OF  A  HEART  RATE  RADIO  TRANSMITTER  (See  Section 
III.B  above).  Animals  and  noise  exposures  were  as  described  for  Objective  No.  2  entitled 
HEART  RATE  AS  A  PREDICTOR  OF  PLASMA  CORTISOL  (See  Section  IV.B  above). 


Behavioral  Responses:  Behavior  was  ecorded  on  black  and  white  videotapes  (Radio 
Shack,  Supe^ape  Videocassette,  T-120)  on  each  day  the  animals  were  exposed  to  a  noise 
stimulus.  The  recording  period  began  about  1  hr  prior  to  and  continued  for  at  least  2.5 
hr  following  the  noise  stimulus.  Behavioral  responses  included  three  defined  groups  (a) 
no  reaction,  (b)  moderate  reaction,  and  (c)  strong  reaction  (Table  V.1). 


Table  V.1.  Behavioral  responses  of  goats  (Espmark  alaL  1974). 


Eating 

No  reaction 
Moderate  reaction 


Strong  reaction 

Lying 

No  reaction 
Moderate  reaction 

Strong  reaction 

Standing 
No  reaction 
Moderate  reaction 
Strong  reaction 

Walking 

No  reaction 
Moderate  reaction 
Strong  reaction 


1 .  Startle,  no  interruption  of  eating 

2.  Interruption  of  eating,  raising  of  head, 
pricking  of  ears,  no  locomotion 

Interruption  of  browsing,  locomotion  of  up  to  10m 


1 .  intentions  to  get  up. 

2.  Quick  raising  of  head,  pricking  of  ears 
Momentary  rising  into  standing  position 


Startle,  transient  raising  of  head,  no  locomotion 
Startle,  transient  nodding  of  head,  locomotion 


Interruption  of  walking,  raising  of  head,looking  around 
Conversion  of  walking  into  running 


Video  Cameras:  In  each  animal  pen  two  video  cameras  (Koyo,  TVC  4000  series.  Closed 
Circuit  TV  Camera)  were  suspended  from  the  ceiling  so  that  each  camera  covered  one- 
half  of  the  pen  area.  Two  clocks  (Seth  Thomas,  Model  No.  0709)  also  were  suspended  in 
such  a  manner  that  each  camera  was  able  to  record  the  time  of  day  that  the  behavior  of 
the  animals  was  videotaped.  The  month/day  were  printed  in  5  in  high  block  numerals  on 
14  cm  X  21.5  cm  sheets  of  paper  which  were  attached  to  each  clock. 
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Videocassette  Recorders:  The  video  cameras  were  wired  to  TV  monitors  (Magnavox, 
Model  BX  3748)  and  video  cassette  recorders  (Quasar,  Model  VH  5290).  Thus,  behavior 
could  be  monitored  in  'real  time'  and  videotaped  for  viewing  at  a  later  time. 


a  RESULTS  AND  DISCUSSION 


Analysis  of  video  tape  recordings  of  goat  behavior  revealed  that  goats  from  both 
experimental  groups  displayed  behavioral  responses  to  single  and  multiple  event  noise 
exposures  (Table  V.2).  No  behavioral  responses  to  single  or  multiple  event  noise 
exposure  were  observed  in  either  control  group  of  animals. 

Table  V.2.  Comparison  of  responses  to  single  event  and  multiple  event  noise  stimuli. 
Thirteen  goats  were  included  in  each  experiment  group  except  for  multiple  3  in  Experiment  No. 

2  in  which  only  12  goats  were  used. 


_ Noise  Stimuli  Event _ 

Experiment  Type  #  of  Interval  Response  to  Stimuli  Number 

_ Group _ Events^  fdal^ _ EiKl _ SubseflUfifll^ 


No.  1 

Single 

1 

0 

Strong 

... 

Single 

1 

8 

Strong 

... 

Multiple 

3 

8 

Strong 

None 

Multiple 

13 

6 

Strong 

Moderate 

Single 

1 

10 

Moderate 

... 

No.  2 

Multiple 

13 

0 

Strong 

Moderate 

Single 

1 

10 

Strong 

... 

Multiple 

3 

14 

Strong 

Strong 

■  Number  of  events  in  a  12min  multiple  type  noise  exposure 

2  Interval  between  noise  stimuli  event;  0  interval  means  the  first  noise  exposure  for  "noise-naive* 
animals 

^  Refers  to  all  stimuli  following  the  initial  stimuli  in  a  multiple  event  exposure 

When  animals  were  exposed  to  a  single  event  noise  stimulus,  the  behavioral 
response  was  listed  as  "strong"  (Table  V.1)  for  all  animals  in  both  experiment  groups 
except  for  the  "moderate"  response  in  the  third  single  event  exposed  goats  in  Experiment 
1 .  It  also  should  be  noted  that  the  Multiple  3  animals  in  Experiment  No.  1  displayed  no 
response  to  the  second  and  third  noise  stimuli  yet  in  the  Multiple  3  goats  of  Experiment 
2  there  was  a  "strong"  response  to  the  second  and  third  stimuli.  These  differences  in 
response  are  thought  to  be  related  to  habituation  which  involves,  among  other  things, 
both  the  interval  between  noise  exposures  and  number  of  repetitive  exposures  to  the 
noise  stimuli. 
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An  analysis  of  heart  rate  data  (Table  V.3)  confirms  the  observed  behavioral 
changes.  Although  heart  rates  increased  in  both  control  and  experimental  groups,  the 
increases  were  55-93%  greater  in  the  latter  group. 

Table  V.3.  Heart  rate  changes  (Mean  ±  SE)  in  control  and  experimental  goats  in  the  5  min 
periods  immediately  prior  to  and  following  application  of  the  single  and  multiple  event  noise 
stim  jli.  Heart  rates  were  recorded  with  a  heart  rate  monitor  (See  text). 


Control _  %  Experimental  % 


Stimulus 

_ 

Pre-noise 

Post-noise 

Incr^ 

Pre-Noise 

Post-noise 

Incr^ 

Single  Event 

3 

107  ±  9 

114  ±7 

6.5 

129  ±  30 

142  ±  33 

10.1 

Multiple  Event 

4 

124  ±  5 

126  ±8 

1.6 

129  ±  13 

139  ±  13 

7.8 

^  Number  of  animals 
^  Post-noise  •  Pre-noise  x  100 

Pre-noise 

Because  the  single  and  multiple  r^vent  noise  exposures  occurred  at  different  time 
intervals  (Table  V.4),  an  effort  also  was  made  to  determine  the  relationship  between  the 
length  of  the  interval  and  the  net  percent  increase  in  heart  rate  in  pre-  and  post-noise 
exposed  control  and  experimental  goats.  A  regression  analysis  revealed  a  non¬ 
significant  indirect  correlation  (r  =  0.333;  r^  =  0.111;  p<0.52)  between  the  length  of 
noise  exposure  interval  and  the  net  percent  increase  in  heart  rate. 

Table  V.4.  Relationship  between  the  net  percent  increase  in  heart  rate  with  the  time  interval 
between  noise  exposures.  The  number  of  events  in  a  12  min  noise  exposure  and  the  interval 
descriptions  are  the  same  as  for  Figure  V.3  above. 


Date 

Tvpe  Event 

Interval  (dal 

Net 

%  Increase 

3  July 

Single 

0 

6.5 

11  July 

Multiple 

8 

1.2 

17  July 

Multiple 

6 

0.0 

27  July 

Single 

1  0 

3.3 

3  August 

Multiple 

0 

7.3 

13  August 

Single 

10 

- 

27  August 

Multiple 

14 

3.3 

D.  CXMXJUSIONS 


The  noise  stimuli  used  in  this  study  resulted  in  measurable  behavioral  responses 
in  goats.  The  degree  of  response  appears  to  be  related  to  habituation-both  length  of 
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interval  between  responses  and  the  number  of  repetitive  exposures.  Further,  the  heart 
rate  rata  revealed  no  responses  that  were  not  apparent  from  changes  in  behavior. 

VI.  OBJECTIVE  NO.  4:  PHYSIOLOGICAL  RESPONSES  OF  GOATS  TO  SIMULATED 

JET  AIRCRAFT  NOISE 

A.  INTRODUCTION 

Changes  in  adrenocortical  activity  (Selye,  1950),  body  temperature  (Stephens, 
1980;  Stermer  fiiaii  1980),  respiratory  frequency  (Stephens,  1980),  and  heart  rate 
(MacArthur  £lai.  1982)  all  have  been  used  to  measure  the  response  of  domestic  animals 
to  environmental  stressors.  Because  animal  handling-which  often  is  associated  with 
sample  collection-will  cause  an  increase  in  all  of  these  measurements,  remote 
monitoring  of  vital  signs  has  been  suggested  as  a  more  reliable  method  of  evaluating  the 
physiological  status  of  an  animal  (Stermer  £ial,  1980). 

Behavioral  adjustments  to  environmental  stressors  also  have  been  reported  to 
provide  information  on  the  physiological  well-being  of  animals  (Dantzer  and  Mormede, 
1983).  But,  behavioral  changes  do  not  always  give  a  reliable  indication  of  potential 
physiological  changes.  For  example,  Lyons  and  Price  (1987),  using  dam-reared  and 
human-reared  goats  observed  no  consistent  relationship  between  heart  rate  and 
behavior.  Moreover,  they  reported  that  significant  behavioral  changes  were  not 
accompanied  by  group  differences  in  heart  rate. 

So,  an  important  question  is  "What  parameters  can  be  used  to  evaluate  the  effects 
of  environmental  stressors?"  Utilizing  both  radiotelemetry  to  monitor  body  function 
(e.g.,  heart  rate  and  body  temperature)  and  other  standard  blood  tests  (e.g.,  plasma 
cortisol,  relative  numbers  of  lymphocytes  and  neutrophils),  the  responses  of  goats  to 
simulated  jet  aircraft  noise  were  evaluated  and  the  results  are  reported  here. 

B.  MATERIALS  AND  METHODS 

Animal  environments,  feeding  and  watering  schedules,  heart  rate  measurements, 
external  mounting  of  transmitters,  and  telemetry  system,  all  were  as  described  for 
Objective  No.  1  entitled  VALIDATION  OF  A  HEART  RATE  RADIO  TRANSMITTER  (See  Section 
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III.B  above).  Animals  and  noise  exposures  were  as  described  for  Objective  No.  2  entitled 
HEART  RATE  AS  A  PREDICTOR  OF  PLASMA  CORTISOL  (See  Section  IV.B  above). 


Statistical  analysis:  White  blood  cell,  heart  rate,  and  plasma  cortisol  changes  were 
analyzed  using  a  commercially  available  personal  computer  software  program  (Stat 
View  512;  Winer,  1971). 

C  RESULTS  AND  DISCUSSION 

There  were  no  detectable  changes  in  plasma  cortisol  for  any  of  the  goat  groups  or 
noise  stressors  (Table  VI. 1).  This  finding  is  similar  to  that  reported  by  Besch  (1989) 
but  unlike  the  one  reported  by  Ames  (1972)  for  lambs.  The  lack  of  response  in  goats 
exposed  to  the  jet  aircraft  engine  noise  stressor  may  have  been  partly  due  to  the  fact  that 
the  intensity/duration  of  the  stimulus  was  insufficient  to  elicit  a  response  or  that  the 
response  was  of  such  short  duration  (i.e.,  it  was  transient)  that  a  response  was 
undetectable  within  the  20  min  post-noise  plasma  sample.  It  also  is  possible  that  the 
goat  does  not  respond  in  any  detectable  way-other  than  behavioral-to  a  noise  stimulus 
from  jet  aircraft  engine  noise.  If  so.  this  indeed  would  be  a  unique  animal  model  for 
noise  studies. 

Table  VI. 1.  Plasma  cortisol  values  (ug/dl)  in  goats  prior  to  and  following  exposure  to  single 
and  multiple  noise  stimuli  (See  text  for  details).  All  values  are  Mean  ±  SE. 


Post-noise _ 

Group - □ - Stimulus _ Pre  noise _ ~2Q  min _ -75  min 

Control 


14 

Single 

0.83  ±  0.09 

0.85  ±.  0.08 

0.97  ±0.12 

7 

Multiple-3^ 

1.28  ±  0.34 

0.88  ±  0.33 

0.56  ±  0.25 

7 

Multiple-132 

0.78  ±  0.25 

0.42  ±  0.08 

0.68  ±0.15 

Experimental 

12 

Single 

0.78  ±  0.10 

0.82  i  0.09 

0.68  ±0.10 

6 

Multiple-3^ 

1 .46  ±  0.40 

1.00  ±  0.52 

0.55  ±  0.14 

6 

Multiple-132 

1.39  +  0.62 

1.01  ±  0.39 

0.67  ±0.18 

’  Three  single  event  stimuli  in  12  min;  ^  13  single  event  stimuli  in  12  min 
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Although  no  consistent,  significant  changes  in  relative  numbers  of  lymphocytes 
(Table  VI .2)  or  neutrophils  (Table  VI. 3)  were  detected,  there  was  a  tendency  toward 
lymphopenia  and  neutrophilia  in  the  experimental  group  of  goats. 

Table  VI.2.  Lymphocyte  (%)  changes  (Mean  ±  SE)  in  goats  exposed  to  single  and  multiple 
event  noise  exposures  (See  text). 


Control: 


Single  Event 

18 

60.2  ±  2.1 

18 

60.5  ±  2.2 

18 

60.1  ±  2.5 

Multiple  Event^ 

1  1 

64.4  ±.  1.9 

12 

62.5  ±  2.5 

13 

61.1  ±  3.3 

Multiple  Event^ 

Jain  (1986) 

13 

58.7  ±  3.0 
56.0 

12 

60.5  ±  2.3 

13 

57.8  ±  3.4 

Experimental: 


Single  Event 

18 

56.7  ±2.3 

18 

60.4 

± 

2.0 

18 

57.9  ±  2.4 

Multiple  Event^ 

12 

58.8  ±  3.7 

12 

57.9 

± 

4.8 

12 

56.7  ±  4.2 

Multiple  Event^ 

13 

56.4  ±  3.2 

13 

54.2 

± 

3.4 

13 

57.9  ±  2.7 

^  Number  of  animals 

2  Three  single  event  stimuli  in  12  min 

3  Thirteen  single  event  stimuli  in  12  min 


Table  VI. 3.  Neutrophil  (%)  changes  (Mean  ±  SE)  in  goats  exposed  to  single  and  multiple 
event  noise  exposures  (See  text). 


Control: 


Single  Event 

1  7 

35.8  ±  2.0 

17 

34.0  ±  2.3 

1  7 

37.1  ±  2.7 

Multiple  Event^ 

12 

32.5  ±  3.9 

12 

36.8  ±  2.3 

12 

38.5  ±  3.4 

Multiple  Event^ 

Jain  (1986) 

12 

33.2  ±  3.3 
36.0 

12 

31.4  ±  3.1 

12 

36.8  ±  3.2 

Experimental: 


Single  Event 

18 

36.6  ±  3.6 

17 

37.2  ±  2.0 

.  7 

40.9  ±  2.3 

Multiple  Event^ 

12 

36.2  ±  3.7 

12 

40.0  i  2.9 

12 

39.4  ±  2.7 

Multiple  Event^ 

12 

37.0  ±  3.1 

12 

38.2  ±  4.5 

12 

38.9  ±  3.8 

^  Number  of  animals 
^  Three  single  event  stimuli  in  1 2  min 
3  Thirteen  single  event  stimuli  in  12  min 
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Lymphopenia  and  neutrophilia  have  been  reported  to  be  stressor  responses 
(Dalton  and  Selye,  1939)  and  the  findings  reported  here  suggest  that  the  noise  stimuli 
were  not  of  sufficie  n  intensity/duration  to  cause  a  statistically  significant  response. 
Whether  these  findings  can  be  extrapolated  to  other  species  of  domestic  animals  is 
uncertain.  Nonetheless,  there  is  evidence  to  assume  that  occasional  flyovers  of  jet 
aircraft  should  not  ^ose  a  particularly  "stressful"  environment  for  domestic  goats. 

No  statistically  significant  differences  were  detected  in  packed  cell  volumes, 
plasma  protein,  or  growth  rate  between  control  and  noised  exposed  groups. 

D.  CONCLUSIONS 

Because  no  hematological  or  humoral  responses  were  detected,  neither  the  noise 
intensities  nor  durations  used  in  this  study  appear  to  be  physiologically  "stressful"  to 
goats.  Further,  if  the  physiological  responses  are  of  such  short  duration  that  they  are 
undetectable  20  min  after  exposure  to  a  noise  stressor,  then  behavioral  changes,  if  any, 
would  appear  to  be  good  indicators  of  an  animal's  response  to  acute  stress. 

VII.  RECOMMENDATIONS 

A.  The  lack  of  detectable  responses  to  jet  aircraft  noise  in  goats  appears  to  be 
related  to  the  intensity  or  duration  of  the  noise  stressor.  Further  studies  are  indicated 
to  determine  the  noise  threshold  that  will  elicit  detectable/measurable  responses  of 
goats 

3.  If  additional  studies  are  conducted,  radioteiemetry  should  be  used  to  obtain  heart 
rates  and  indwelling  venous  cannulae  should  be  used  in  the  collection  blood  samples  from 
domestic  animals  exposed  to  noise  stressors. 

C.  Because  of  the  advantages  of  using  cardiac  frequency  as  an  indicator  of  plasma 
cortisol  during  exposure  to  environmental  stressors,  additional  research  should  be 
conducted  to  determine  if  this  relaiionship  can  be  established  for  goats. 

D.  The  behavioral  responses  of  goats  to  simulated  jet  aircraft  engine  noise  suggests 
that  either  habituation  or  repeated  exposures  modifies  the  animals'  responses  to 
subsequent  noise  stressors,  t/hat  is  unknown  is  the  temporal  relationship  between 
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exposures  to  a  noise  stressor(s)  and  elicitation  of  a  detectable  response.  In  any  event, 
additional  study  in  this  area  also  should  be  encouraged. 

E  Data  collected  during  this  study  should  be  published  in  the  open  scientific 
literature.  In  the  meantime,  a  copy  of  this  report  should  be  shared  with  the  Noise  and 
Sonic  Boom  Impact  Technology  (NSBIT)  program  office  at  Wright- Patterson  AFB,  Ohio. 

F.  Because  animal  handling  can  affect  the  measured  physiological  parameters, 
studies  using  non-invasive  procedures  should  be  encouraged. 
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ABSTRACT 

The  Air  Force  has  identified  spills  of  jet  fuel  as  a  major  source  of  groundwater 
contamination  on  bases  across  the  nation.  To  investigate  the  separate  phase  movement  of 
organic  liquids  such  as  jet  fuel,  numerical  models  are  often  used.  Unfortunately,  there  are 
few  data  available  for  key  relationships  in  these  models  for  many  organic  liquids. 
Consequently,  estimation  techniques  are  often  employed  to  determine  the  relationships  of 
capillaiy  pressure  and  relative  permeability.  Previous  research  has  denaonstrated  that 
estimates  of  these  relationships  generated  with  common  techniques  sometimes  do  not 
correlate  well  with  measurements.  Thus,  the  use  of  these  techniques  may  introduce  error 
into  simulations  of  jet  fuel  migration  in  the  subsurface.  The  purpose  of  this  research  was  to 
assess  the  magnitude  of  this  error.  Simulations  of  organic  liquid  flow  were  carried  out 
using  measured  and  estimated  capillary  pressures  and  relative  permeabilities.  A 
comparison  of  the  simulations  shows  that  the  advance  of  the  organic  liquid  front  and  the 
total  mass  of  organic  liquid  infiltrated  is  significantly  overpredicted  when  estimated 
propenies  are  used.  Thus,  caution  must  be  exercised  in  basing  estimates  of  quantity  of  jet 
fuel  spilled  and  location  of  the  spill  boundaries  on  simulations  using  estimates  of  the 
transport  properties,  capillary  pressure  and  relative  permeability. 

The  results  of  this  research  have  been  presented  at  the  Symposium  on  Characterization  of 
Transport  Phenomena  in  the  Vadose  Zone,  Tucson,  AZ,  April  2-5,  1991  (sponsored  by 
American  Geophysical  Union  and  Soil  Science  Society  of  America),  and  will  be  submitted 
shortly  for  publication  in  Journal  of  Contaminant  Hydrology. 
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I.  INTRODUCTION 


The  Air  Force  has  identified  spills  of  jet  fuels,  such  as  JP-4,  as  a  major  source  of 
groundwater  contamination  on  bases  across  the  nation.  Owing  to  the  low  aqueous 
solubility  of  jet  fuel  in  water,  jet  fuel  persists  as  a  separate  liquid  phase  and  is  transported 
as  such  in  groundwater.  Despite  the  recognition  that  multiphase  flow  processes  are 
important,  many  questions  about  the  physics  of  such  processes  remain  unanswered. 

To  investigate  the  subsurface  movement  of  organic  liquids  such  as  JP-4  from  spill  sites, 
often  numerical  models  are  used.  These  numerical  models  are  based  on  equations  which 
require  the  use  of  constitutive  relationships  for  solution  (Abriola  and  Finder,  1985;  Faust, 
1985;  Osborne  and  Sykes,  1986;  Corapcioglu  and  Baehr,  1987;  Kuppusamy  et  al.,  1987). 
Unfortunately,  little  data  exist  for  several  of  these  constitutive  relationships  for  the  organic 
liquids  of  interest,  making  the  accurate  simulation  of  their  migration  difficult  (Schwille, 
1984;  Faust,  1985;  Finder  and  Abriola,  1986).  Two  relationships  of  particular  importance 
are  capillary  pressure  and  relative  permeability  as  functions  of  saturation.  Because  of  the 
lack  of  data,  many  numerical  simulations  employ  relationships  that  have  been  estimated 
using  a  variety  of  techniques  developed  in  soil  science  and  petroleum  engineering. 
However,  as  Demond  and  Roberts  (1991a,  1991b)  pointed  out,  the  estimates  produced  by 
these  techniques  sometimes  do  not  correlate  well  with  measurements.  Consequently,  use 
of  these  techniques  may  introduce  error  into  simulations  of  JP-4  migration  in  the 
subsurface. 


II.  OBJECTIVE 

The  purpose  of  this  research  was  to  evaluate  the  significance  of  the  error  introduced  into 
simulations  of  two-phase  flow  through  the  use  of  estimated  transport  relationships  rather 
than  measured.  To  accomplish  this  purpose,  two  series  of  numerical  simulations  were 
carried  out;  one  series  using  common  estimation  techniques,  and  the  other  using  published 
capillary  pressure  and  relative  permeability  data.  Then,  the  two  series  of  simulations  were 
compared  in  terms  of  location  of  organic  liquid  displacement  front  and  total  mass  of  organic 
liquid  infiltrated  and  the  accuracy  of  the  estimation  techniques  critiqued  based  on  the 
comparison. 


For  a  system  composed  of  a  rigid,  non-deformable,  homogeneous  porous  medium,  and 
two  mutually  saturated,  incompressible  liquid  phases,  with  no  internal  sources  or  sinks,  the 
fundamental  mass  balance  equations  foi  two-phase  flow  may  be  written: 


^  .  [22!^  (VP.  -  p.gVh)l  =  [1] 

V.[e!^(VP„  -p„gVh)l  =  ?L"P"g!!)  [2] 

where  p  =  density, 

k  =  intrinsic  permeability, 

kr  =  relative  permeability, 

^  =  viscosity, 

P  =  pressure, 

g  =  gravitational  acceleration  constant, 

h  =  elevation  above  a  datum, 

n  =  porosity, 

S  =  saturation, 
t  =  time, 

subscripts 

n  =  nonwetting  (here,  the  organic  liquid),  and 
w  =  wetting  (here,  the  aqueous  phase). 

To  obtain  a  solution,  the  constitutive  relationships  of  capillary  pressure: 

Pn-Pw  =  Pc  =  f(S)  [3] 

where  Pc  =  capillary  pressure 

and  relative  permeability: 

kri  =  ki/k  =  f(S)  [4] 
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where  kj  =  effective  permeability  to  ith  phase, 

must  be  provided  in  order  to  form  a  close  system  of  equations. 

These  relationships  may  be  measured  in  the  laboratory.  But,  because  their  measurement 
can  be  time-consuming  or  impractical,  they  are  often  estimated.  Currentiy,  the  method  of 
choice  for  estimating  the  primary  drainage  capillary  pressure-saiuratiofi  relationship  for 
organic  liquid- water  systems  follows  from  Leverett’s  function  (Leverett,  1941)  (Lenhard 
and  Paricer,  1987;  Parker  et  al.,  1987;  Cary  et  al.,  1989): 

J(S^)  =  Pci/YoRG1/H2o('^i/‘*i)*^  ~  ^c2^0RG2yH2o(^/"2)^^ 

where  J(Sw)  =  Leverett’s  function, 

Yorg/h20  “  interfacial  tension  between  organic  liquid  and 

aqueous  phases. 

In  situations  where  the  systems  have  the  same  intrinsic  permeability  and  porosity  (ki=k2; 
ni=n2),  then  Leverett's  function  simplifies  to: 


J(Sw)  =  Pcl/YoRGlAt20  Pc2^0RG2/H20 
Pc2  =  Pci  (Yorg2/H2c/YoRG1/H20^ 

Hence,  the  capillary  pressure  at  a  given  saturation  for  an  organic  liquid-water  system,  can 
be  generated  from,  for  example,  the  air-water  capillary  pressure-saturation  relationship  for 
that  soil  by  knowing  the  ratio  of  the  liquid-liquid  inteifacial  tensions.  This  method  is 
usually  applied  to  drainage,  but  it  seems  that  it  can  be  readily  applied  to  imbibition  if  the 
relevant  data  are  available  (Demond  and  Roberts,  1991a).  Eqn.  [6]  implies  that  the  value 
for  residual  saturation  of  the  wetting  phase  is  the  same  for  all  systems;  simply  the  capillary 
pressure  at  which  that  value  is  obtained  varies. 

Methods  to  estimate  relative  permeability  can  be  divided  into  two  basic  groups:  those  based 
on  capillary  pressure  measurements  and  those  whose  functional  form  is  a  power  function 
of  effective  saturation,  where  effective  saturation  is  defined  as: 
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[7] 


Q  (Sw  ~  Syvr) 

(1-Swr) 

where  Swr  =  residual  saturation  of  the  wetting  phase 

Techniques  commonly  used  fall  in  both  groups:  Burdine's  (1953)  and  Mualem's  (1976), 
from  the  former,  and  Corey's  (1954)  and  Wyllie's  (1962)  from  the  latter  (Table  1).  Both 
types  of  techniques  have  drawbacks.  The  power  function  models  give  the  same  results  for 
all  organic  liquid-water  pairs.  Using  the  models  based  on  capillary  pressure  measurements 
assumes  that  such  data  are  available.  If  these  data  are  estimated  using  Eqn  [6],  then  again 
the  same  results  are  obtained  for  all  organic  liquid-water  pairs.  These  methods  were 
developed  for  drainage  processes.  They  may  be  applied  to  the  imbibition  of  the  wetting 
phase  in  strongly- wetted  systems,  since  that  phase  does  not  exhibit  hysteresis.  However, 
the  nonwetting  phase  does  display  hysteresis.  Yet,  comparable  methods  for  the  prediction 
of  km  during  imbibition  are  uncommon. 

Demond  and  Roberts  (1991a,  1991b)  recently  compared  estimated  and  measured  capillary 
pressure  and  relative  permeability  relationships  for  organic  liquid-water  systems.  Their 
study  showed  that  as  the  interfacial  forces  decreased,  the  deviations  between  measured 
capillary  pressure  relationships  and  those  estimated  using  Eqn.  [6]  increased  (Fig.  1).  The 
imbibition  capillary  pressure  relationship  showed  a  stronger  effect  than  did  the  drainage 
relationship.  The  relative  permeability  of  the  wetting  phase  in  strongly-wetted  systems  is 
reasonably  well  estimated  by  the  models  listed  in  Table  1  (Fig.  2).  On  the  other  hand,  the 
relative  permeability  to  the  nonwetting  phase  is  grossly  overestimated  particularly  at  low 
aqueous  phase  saturations  (Fig.  3).  In  addition,  the  accuracy  of  the  estimates  for  both 
phases  grows  poorer  with  decreasing  interfacial  forces. 

Despite  the  observation  of  these  differences  between  measured  and  estimated  transport 
relationships,  no  quantitative  discussion  of  the  implications  for  the  tnodeling  of  organic 
liquid  migration  was  given.  Notwithstanding  the  magnitude  of  these  differences,  they  are 
only  important  in  so  far  as  they  affect  estimates  of  organic  liquid  movement  in  the 
subsurface.  This  research  addresses  that  issue.  To  assess  the  effect  of  estimated  transport 
relationships  on  predictions  of  two-phase  flow,  the  migration  of  an  organic  liquid  was 
simulated  using  both  measured  and  estimated  data.  Three  series  of  simulations  were 
performed:  using  all  measured  data,  using  measured  capillary  pressure  and  estimated 
relative  permeability  relationships,  and  using  estimates  for  both  relationships. 
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Figure  3.  Comparison  of  measured  and  estimated  relative  permeability-saturation  relationships  for  the  organic  liquid 


IV.  METHODOLOGY 


To  investigate  the  effect  of  estimated  transport  relationships  on  predictions  of  multiphase 
flow,  a  simple,  one-dimensional  scenario  is  hypothesized.  An  aquifer  is  confined  and  at 
atmospheric  pressure.  It  has  no  atural  pressure  gradient  and  is  initially  saturated  with 
water.  An  underground  storage  tank  filled  with  organic  liquid  located  at  x=0  springs  a  leak 
at  time  t=0  (Fig.  4).  The  head  in  the  tank  is  sufficient  to  displace  all  the  water  at  the 
upstream  boundary.  The  tank  has  a  sufficiently  large  circumference  so  that  leakage  into  the 
aquifer  does  not  significantly  affect  the  height  of  liquid  in  the  tank.  The  downstream 
boundary,  x=L,  is  located  at  a  sufficient  distance  so  that  the  front  of  the  organic  liquid  does 
not  reach  the  boundary.  The  appropriate  equations,  initial  and  boundary  conditions  for 
such  a  situation  are: 


0  ^Pwkkfw  /0Pw\i  ^(npwSw) 


[8] 


a  rPnitkm  /aPnM  a(npnSn) 

— 3i — 


[91 


at  t  =  0, 

Pw  =  0 

o 

II 

e 

a, 

for  all  x; 

att>0 

Pw  =  0 

Pn  =  50  cm 

atx  =  0 

Pw  =  0 

atx  =  L 

Three  organic  liquid-water  pairs  were  employed  in  the  simulations.  Their  properties  are 
summarized  in  Table  2.  n-Dodecane  and  o-xylene  are  components  of  JP-4  (Smith  et  al., 
1981)  and  clean  JP-4  has  a  similar  density,  viscosity,  and  liquid-liquid  interfacial  tension  to 
n-dodecane  tSittig,  1985).  The  porous  medium  was  similar  to  the  sand  at  Tyndall  AFB. 
The  similarity  in  properties  is  shown  in  Figure  5  (Demond,  1989;  Demond  and  Roberts, 
1991a).  It  was  a  clean,  fine  to  medium  sand  from  a  contaminated  aquifer  located  in 
Borden,  Ontario  (Mackay  et  al.,  1986).  The  porosity  and  intrinsic  permeability  of  the  sand 
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is  given  in  Table  3.  More  details  about  the  liquids  and  the  sand  can  be  found  in  Demond 
and  Roberts  (1991a  and  1991b). 


Figure  4.  Scenario  for  simulations  of  organic  liquid  migration. 


X  =  0  X  =  L 


TABLE  2.  Properties  of  Liquids  Used  in  Simulations 


Liquid 

p  (g/cm^) 

p(cp) 

YORG/H2O 

(dynes/cm) 

Contact  angle  on 
glass*  (°) 

n-dodecane 

0.7487 

1.508 

52.8 

17 

[25°C] 

[2] 

o-xylene 

0.8801 

0.810 

36.1 

[3] 

25 

n-butyl  acetate 

0.8764 

0.7375 

14.5 

59 

(25°C] 

[25'’C] 

[1] 

water 

0.9882 

1.002 

NA 

NA 

*The  second  liquid  is  water.  The  angle  is  reported  as  measured  through  the  aqueous 

phase.  All  contact  angle  measurements  are  from  Demond  (1988).  All  values  are  at  20°C 
unless  otherwise  indicated.  All  other  data  are  from  Riddick  et  al.  (1986)  unless  indicated 
by  [number]:  [1]  Donahue  and  Bartell  (1952);  [2]  Johnson  and  Dettre  (1966);  [3]  Young 
and  Harkins  (1928). 


Figure  5.  Comparison  of  properties  of  Borden  and  Tyndall  sands. 


TABLE  3.  Properties  of  Borden  Sand 


porosity 

(-) 

intrinsic 

permeability 

(cm2) 

0.33 

8.4x10-8 

The  measurements  of  capillary  pressure  and  relative  permeability  were  obtained  from 
Denaond  and  Roberts  (1991a)  and  (1991b)  respectively.  The  capillary  pressure  data  were 
then  fit  with  van  Genuchten's  function  (van  Genuchten,  1980): 


Se  = 


_ 1 

LI  +  (c 


nl-i/n 


(aPc)"J 


[10] 


where  n  and  a  =  fitting  constants. 


using  the  statistics  package  SYSTAT  (SYSTAT,  Inc.  Evanston,  II)  to  determine  n  and  a 
through  nonlinear  regression. 

Several  functional  forms  for  the  relative  permeability  data  were  tried.  The  rrxjst  successful 
were  the  following  equations,  derived  from  the  equations  given  in  Table  1 : 


krw  =  Sea[l-(l-Se>/b)b] 

[11] 

km  =  c(l-Se)‘l(l-Se»/e)e 

[12] 

where  a,  b,  c,  d,  and  e  =  fitting  constants. 

The  constants,  a,  b,  c,  d,  and  e  were  also  determined  using  SYSTAT. 

For  the  two-phase  flow  simulations,  the  original  intention  was  to  use  SWANFLOW 
(GeoTrans,  1989),  a  commercially  available  finite  difference  code.  Two  versions  of  this 
code,  the  original.  Version  1.0,  and  the  most  recent.  Version  3.0,  were  purchased. 
Unfortunately,  both  contained  problems  and  the  documentation  accompanying  the  codes 
was  sufficiently  incomplete  that  it  was  difficult  to  resolve  the  problems.  Eventually,  this 
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ccxie  was  abandoned  in  favor  of  a  code  developed  by  Reeves  and  Abiiola  (Reeves  and 
Abriola,  1988;  Reeves,  1991).  The  selected  code  uses  finite  element  approximation  for  the 
spatial  derivatives.  Linear  basis  funcdons  are  employed  and  weighting  is  based  on  the 
streamline-upwind  Petrov-Galerkin  technique.  The  time  derivatives  are  approximated 
using  finite  differences  and  the  resulting  matrices  are  solved  using  successive  substitution 
or  full  Newton- Raphson  should  initial  convergence  prove  unsatisfactory.  The  code  was 
executed  on  the  Apollo  network  of  workstations  operated  by  CAEN  (Computer  Aided 
Engineering  Network)  at  the  University  of  Michigan.  The  nodal  spacing  employed  in  the 
simulations  was  0.75  cm  and  the  length  of  the  domain  was  45  cm.  Due  to  computational 
constraints,  the  time  span  of  the  simulations  was  limited  to  1000  secs. 

Three  series  of  simulations  were  performed: 

1)  using  measured  data  for  both  capillary  pressure  and  relative  permeability, 
parameterized  using  Eqn.  [10],  and  Eqns.  [11]  and  [12],  respectively, 

2)  using  measured  capillary  pressure  relationships,  but  estimated  relative 
permeabilities  based  on  the  correlations  in  Table  1,  and 

3)  using  estimates  for  both  relationships,  with  capillary  pressure  relationships  for 
organic  liquid-water  pairs  based  on  air-water  data  (parameterized  using  Eqn.  [10]) 
and  Eqn  [6],  and  with  relative  permeability  relationships  given  by  Mualcm's 
correlation  (Table  1).  This  combination  was  selected  because  it  appears  to  be  the 
most  popular  in  simulations  of  subsurface  organic  liquid  flow. 


V.  RESULTS  AND  DISCUSSION 

The  ability  of  van  Genuchten's  function  (Eqn.  10)  to  fit  the  capillary  pressure  data  is 
shown  in  Figure  6.  The  ability  of  Eqns.  [11]  and  [12]  to  fit  the  relative  permeability  data  is 
shown  in  Figures  2  and  3.  These  figures  demonstrate  that  these  functional  forms  are 
suitable  for  the  parameterization  of  organic  liquid-water  transpon  relationships.  The 
parameters  for  capillary  pressure  and  relative  permeability  relationships  are  given  in  Tables 
4  and  5,  respectively. 

Figure  7  shows  a  sample  simulation  using  measured  data,  illustrating  the  progression  of  the 
organic  liquid  front  over  time.  The  fronts  at  two  times,  495  seconds  and  1000  seconds,  are 
shown  again  in  Figures  8  and  9,  respectively;  but  here,  these  fronts  are  compared  to  those 
generated  using  estimated  relationships.  The  comparison  shown  in  Figures  8  and  9  reveals 
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a  significant  discrepancy  between  the  predicted  location  of  the  front  using  measured 
relationships  and  those  using  estimated  relationships.  Most  of  this  discrepancy  probably 
stems  from  the  poor  approximation  of  the  relative  permeability  to  the  organic  liquid 
(Figures  1-3).  The  discrepancy  grows  if  both  the  capillary  pressure  and  relative 
permeability  relationships  are  predicted.  The  higher  saturations  at  x=0  produced  in  the 
simulations  for  the  combination  of  estimated  capillary  pressure  and  estimated  relative 
permeability  (Series  3)  result  from  the  fact  that  Leverett’s  function  predicts  a  lower  residual 
saturation  than  measured  (Figure  1).  Comparing  Figures  8  and  9  shows  that  the 
discrepancy  increases  with  time.  Based  on  this  observation  at  short  times,  one  would 
expect  even  larger  differences  at  time  scales  typical  of  field  problems. 

Table  6  lists  estimates  of  the  total  volume  of  organic  liquid  infiltrated  per  unit  area, 
corresponding  to  the  displacement  fronts  shown  in  Figures  8  and  9.  The  error  in  the 
estimate  of  the  amount  of  organic  liquid  infiltrated  is  similar  for  all  four  methods  for 
calculating  relative  permeability,  ranging  from  about  10  to  12%  at  495  seconds.  If  the 
capillary  pressure  is  also  estimated,  the  error  increases  to  about  25%.  At  longer  times,  the 
error  may  be  greater.  At  1000  seconds,  the  error  resulting  from  estimation  of  both  capillary 
pressure  and  relative  permeability  is  on  the  order  of  30%. 

Tiie  simulations  shown  here  are  all  of  the  drainage  process.  Since  previous  work  shows 
that  the  discrepancies  between  measured  and  estimated  imbibition  relationships  are  even 
greater,  one  would  expect  the  errors  to  be  larger  in  simulations  of  the  imbibition  process. 


VI.  CONCLUSIONS 

The  results  presented  here  suggest  that  if  the  capillary  pressure  and  relative  permeability  are 
estimated  using  techniques  commonly  employed,  the  location  of  the  displacement  front  and 
the  volume  of  organic  liquid  infiltrated  may  be  grossly  overpredicted.  In  the  field,  these 
errors  may  translate  into  an  inability  to  locate  the  boundaries  of  a  jet  fuel  spill  and  an 
overestimation  of  the  amount  of  fuel  that  is  present  in  the  subsurface.  The  results  point  to 
the  need  to  use  measured  relationships  rather  than  estimated  in  simulations  of  two-phase 
flow.  Yet,  it  is  impractical  to  measure  these  relationships  in  many  circumstances. 
Consequently,  attention  must  be  devoted  to  the  development  of  more  accurate  estimation 
techniques  for  capillary  pressure  and  relative  permeability  for  organic  liquids  in  the 
subsurface. 
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Figure  8.  Displacement  fronts  at  495  seconds  simulated  using  measured  and  estimated  transport  relationship; 


spLdcement  fronts  at  1000  seconds  simulated  using  measured  and  estimated  transport  relationsiiips 
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TABLE  5.  Parameters  for  Measured  Relative  Permeability-Saturation  Relationships 


TABLE  6.  Comparison  of  Volumes  infiltrated  Per  Unit  Area  of  Aquifer  Using  Measured  and  Estimated  Transport  Relationships 
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Units  are  cmVcm 


VII.  RECOMMENDATIONS 

The  research  presented  in  this  report  shows  that  the  location  of  the  organic  liquid 
displacement  front  and  the  amount  of  organic  liquid  infiltrated  may  be  significantly 
overpredicted  if  estimated,  rather  than  measured,  transport  relationships  are  used  in  the 
transport  simulations.  This  error  may  translate  into  large  overestimates  of  quantities  of  jet 
fuel  located  in  the  subsurface  at  a  given  spill  site,  or  an  inability  to  locate  the  boundaries  of 
a  subsurface  spill.  This  conclusion  results  in  two  recommendations: 

1)  Caution  must  be  exercised  in  basing  assessment  and  clean-up  strategies  on 
transport  simulations  which  employ  estimated  two-phase  flow  transport 
relationships.  An  analysis  which  examines  the  consequences  of  estimates  being 
significantly  in  error  should  be  part  of  any  assessment  program. 

2)  To  make  simulations  of  two-phase  flow  more  useful  in  subsurface  remediation, 
research  must  be  devoted  towards  developing  more  accurate  methods  for  the 
estimation  of  capillary  pressure  and  relative  permeability  relationships.  The  research 
should  address  not  only  the  reasons  why  the  common  methods  for  drainage  fail  to 
do  an  adequate  job,  but  also  the  lack  of  accepted  means  for  estimating  imbibition 
relationships. 
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ABSTRACT 

Enhancing  the  sorptive  capacity  of  natural  aquifer  materials  is  a 
relatively  new  approach  to  bringing  about  aquifer  remediation  and  protec¬ 
tion.  Previous  investigations  had  focused  on  the  application  of  cationi*. 
surfactants  to  enhance  sorption;  however,  the  utility  of  any  cationic 
surfactant  is  constrained  by  the  cation  exchange  capacity  of  the  target 
aquifer  and  the  biocidal  characteristics  of  the  surfactant.  An  alter¬ 
native  sorbent  could  be  an  innocuous  form  of  a  sorptive  nonaqueous  phase 
liquid  (SNAPL).  The  first  half  of  this  report  presents  evidence  of  signi¬ 
ficant  enhanced  sorption  in  soil  columns  possessing  a  nontoxic  SNAPL 
residual.  Breakthrough  curves  (BTCs)  of  pentafluorobenzoic  acid  (PFBA), 
benzene,  xylene,  PCE,  toluene,  and  1 -methyl naphthalene  were  measured 
during  aqueous  elution  through  soil  columns  totally  saturated  with  water 
and  through  soil  columns  containing  a  residual  saturation  of  decane. 
Three  conceptual  models  were  used  to  simulate  premature  breakthrough, 
asymmetry,  and  tailing  in  the  BTCs  caused  by  nonequilibrium  sorption  and 
rate  limited  mass  transfer  between  decane  and  aqueous  phases.  The  second 
half  of  the  report  is  devoted  to  the  development  and  application  of  a  new 
concept,  the  'effective  retardation'  which  can  be  used  in  subsurface 
sorption  system  design  and  in  the  prediction  of  system  performance. 
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1.0  INTRODUCTION 

Groundwater  contamination  resulting  from  fuels  and  oils  is  particu¬ 
larly  nefarious  because  of  the  widespread  use  of  petroleum  based  products 
and  the  health  risks  associated  with  the  hydrophobic  organic  components. 
Natural  geologic  materials  generally  have  a  low  retentive  capacity  for 
neutral  organic  compounds  (Schwarzenbach  and  Westall  1981;  Banerjee  et  al . 
1985;  Bouchard  et  al .  1988a;  and  Stauffer  et  al .  1989).  Little  or  no 
natural  retention  reduces  local  constituent  residence  times  and  decreases 
the  opportunity  for  biotic  and  abiotic  processes  to  naturally  attenuate 
contaminants. 

Enhanced  subsurface  sorption  has  been  suggested  as  an  innovative 
groundwater  remediation  technique  (Burris  and  Antworth  1990  and  Hatfield 
et  al .  1991)  which  could  be  used  alone  or  in  conjunction  with  more  tradi¬ 
tional  remediation  technologies.  Sorption  zones  could  be  created  within 
the  aquifer.  Enhancing  the  sorption  capacity  of  the  porous  matrix  would 
retard  contaminant  movement  and  concentrate  pollutants  within  localized 
regions  of  the  aquifer.  One  or  more  of  these  sorption  zones  would 
constitute  a  subsurface  sorption  system  (SSS).  Fig  1.  illustrates  the 
conceptual  application  of  a  SSS  to  intercept  a  plume  of  contaminants 
released  into  a  surficial  aquifer. 

The  use  of  cationic  surfactants  to  increase  the  retardation  factors 
in  geologic  materials  has  been  the  focus  of  several  recent  investigations. 
Lee  et  al .  (1989)  produced  a  200  fold  increase  in  sorption  coefficients 
for  a  subsurface  soil  treated  with  an  organic  cation  (hexadecyltri- 
methylammonium) .  Burris  and  Antworth  (1990)  modified  aquifer  material 
with  cationic  surfactants  and  subsequently  increased  the  sorption 
coefficients  for  common  groundwater  contaminants  by  three  orders  of 
magnitude.  Effective  use  of  cationic  surfactants  is  highly  dependent  upon 
the  cation  exchange  capacity  (CEC)  of  the  aquifer  material.  Many  aquifer 
materials  have  \ow  CEC's,  thus  limiting  the  potential  application  of 
cationic  surfactants.  Many  of  the  cationic  surfactants  of  interest  have 
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biocidal  activity  that  may  limit  the  ability  to  couple  a  cationic 
surfactant  with  biodegradation.  Clearly,  other  methods  of  enhancing 
sorption  need  to  be  examined. 

A  suggested  alternative  to  organic  cations  would  be  a  nontoxic, 
nonaqueous  phase  liquid  (NAPL).  The  sorptive  capacity  of  subsurface 
materials  treated  with  a  sorbing  NAPL  (SNAPL)  would  depend  upon  the 
residual  SNAPL  saturation  and  the  contaminant  SNAPL-water  partition 
coefficient.  The  residual  saturation  of  SNAPL  is  not,  however,  limited  by 
the  available  CEC  of  the  soil,  as  is  the  case  with  cationic  surfactants. 

The  first  half  of  this  report  examines  the  transport  and  retention 
of  several  dissolved  organics  in  a  soil  partially  saturated  with  a  SNAPL 
(decane).  Miscible  displacement  experiments  were  performed  with  water 
saturated  soil  columns  and  with  other  soil  columns  saturated  with  water 
and  residual  decane.  Organic  tracers  used  in  the  study  were  pentfloroben- 
zoic  acid  (PFBA),  benzene,  xylene,  PCE,  toluene,  and  1-methylnaphthalene. 
Three  existing  transport  models  were  used  to  simulate  the  premature  solute 
breakthrough,  asymmetry,  and  tailing  in  the  BTCs  caused  by  physical  and 
sorption  related  nonequilibrium.  Nonequil ibri urn  model ing  is  of  interest, 
because  the  conditions  giving  rise  to  nonequil  ibri  urn  sorption  may  result 
in  premature  breakthrough  of  contaminants  through  the  sorption  system.  It 
is  therefore  necessary  to  investigate  conditions  causing  nonequilibrium 
and  to  identify  suitable  models  to  predict  premature  failure  of  the 
sorption  system.  The  second  half  of  the  report  is  devoted  to  predicting 
the  performance  of  a  SSS  created  to  enhance  the  contaminant  retardation 
capabilities  of  an  aquifer.  The  concept  of  'effective  retardation'  is 
introduced  as  an  aid  to  predict  system  performance  and  to  facilitate 
system  design.  An  analysis  develops  the  necessary  equations  for 
estimating  the  performance  of  a  cylindrical  sorption  system,  while  an 
application  demonstrates  the  utility  of  the  effective  retardation  concept. 

2.0  OBJECTIVES 

The  goal  of  this  research  effort  was  to  develop  a  knowledge  base 
useful  in  the  design  of  sorption  systems  and  the  prediction  of  system 
performance  in  the  field.  The  initial  objectives  specified;  1)  a  verifi- 
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cation  effort  for  a  3-dimensional  immiscible  flow  model,  2)  bench-scale 
studies  to  generate  data  for  model  verification,  and  3)  a  series  of 
miscible  displacement  experiments  with  xylene  and  1-methylnaphthalene, 
which  would  provide  data  needed  to  predict  sorptive  system  capacity  and 
performance. 

Originally,  it  was  presumed  that  direct  injection  of  the  sorbing 
NAPL  (SNAPL)  was  the  best  way  to  install  a  sorption  zone.  Direct  injec¬ 
tion,  however,  is  not  likely  to  create  a  uniform  distribution  of  SNAPL  in 
the  porous  media.  Without  knowing  the  spatial  distribution  of  SNAPL,  it 
would  be  difficult  to  predict  sorption  system  performance;  this  is  because 
distribution  of  SNAPL  affects  sorption  capacity  and  contaminant 
velocities.  It  was  then  surmised  that  a  verified  3-dimensional  immiscible 
flow  model  would  be  needed  to  predict  residual  SNAPL  saturations  and  pore 
velocities  throughout  the  sorption  zone;  and  it  was  this  need,  that 
justified  the  research  objectives  leading  to  the  verification  of  an  immis¬ 
cible  flow  model.  Early  in  the  project,  however,  a  review  of  ground 
modification  techniques,  revealed  that  there  are  desirable  alternatives  to 
direct  injection.  One  alternative  was  to  use  jet  grouting  to  create 
sorption  zones  of  desired  dimensions.  Jet  grouting  removes  the  subsurface 
soils  without  surface  excavation.  Subsurface  soil  are  then  replaced  with 
a  substitute  material.  Jet  grouting  offers  control  over  sorption  zone 
dimensions,  permeability,  and  the  residual  SNAPL  distribution.  Clearly, 
with  an  emplacement  technique  such  as  jet  grouting,  model  predictions  of 
residual  SNAPL  saturations  are  no  longer  necessary;  hence,  there  was  no 
longer  a  need  to  verify  an  immiscible  flow  model.  On-the-other-hand, 
predictions  of  pore  water  velocities  inside  and  around  the  sorption  system 
are  still  needed  to  predict  system  performance. 

Given  the  availability  of  an  emplacement  technique  that  nullifies 
initial  justifications  for  verifying  an  immiscible  flow  model,  project 
objectives  were  amended  in  consultation  with  the  project  officer.  The 
amended  objectives  were  formulated  with  careful  attention  given  to  pre¬ 
serve  the  original  goal  of  this  research  effort;  that  is,  to  develop  a 
knowledge  base  useful  in  the  design  of  sorption  systems  and  the  prediction 
of  system  performance  in  the  field.  The  amended  objectives  also  reflect 
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additional  miscible  displacement  experiments.  Early  research  efforts  were 
quite  successful  in  column  breakthrough  experiments.  After  consultation 
with  the  project  officer,  it  was  agreed  that  additional  experiments  were 
necessary  and  in  the  best  interest  of  meeting  the  project  goal.  The 
finalized  objectives  of  this  research  effort  were: 

1)  to  perform  column  breakthrough  experiments  on  Xylene,  1- 
Methylnaphthalene,  Benzene,  Toluene,  and  PCE  which  deliver 
data  needed  to  predict  sorption  system  capacity  and  perfor¬ 
mance  for  these  contaminants, 

2)  to  develop  general  analytical  expressions  which  can  define  the 
velocity  field  and  predict  the  performance  of  subsurface 
sorption  systems  of  cylindrical  geometry. 


3.0  MISCIBLE  DISPLACEMENT  EXPERIMENTS 

3.1  THE0.1Y 

3.1.1  Model  1.  Two  Region  Noneouil ibrium  Model 

Premature  contaminant  breakthrough,  asymmetry,  and  tailing  in  BTCs 
for  hydrophobic  contaminants  may  be  due  to  large  heterogeneities  in  pore 
water  velocities.  The  velocity  variations  create  a  physical  nonequili¬ 
brium  condition  that  affects  both  sorbing  and  nonsorbing  solutes.  The  two 
region  nonequilibrium  model,  first  proposed  by  Coats  and  Smith  (1964)  for 
nonreactive  solutes,  presumes  that  the  complex  saturated  media  may  be 
modeled  as  a  two  region  domain.  In  the  first  region  the  fluid  is  mobile, 
which  is  where  convective  dispersive  solute  transport  occurs.  In  the 
second  region,  the  fluid  is  relatively  stagnate  or  immobile.  The  'mobile' 
and  'immobile'  regions  are  connected;  but,  rate-limited  diffusion  controls 
solute  exchange  between  the  mobile  and  immobile  regions.  Van  Genuchten 
and  Wierenga  (1976)  extended  the  nonreactive  contaminant  model  to 
incorporate  linear  sorption.  The  governing  equation  describing  transport 
in  the  mobile  region  is 
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Transport  in  the  immobile  region  is  given  by 


e,.  -jf  *  <1  -  p  -jf  =  *i<c.  -  c,j  (2) 

in  which  and  are  respectively  the  dissolved  and  sorbed  solute 
concentrations  in  the  mobile  region,  and  c.  and  S  are  the  corresponding 
dissolved  and  sorbed  phase  concentrations  in  the  immobile  region.  The 
bulk  density  of  the  porous  media  is  p.  The  total  volumetric  water  content 
of  the  saturated  media,  0,  is  the  summation  of  volumetric  water  content 
for  mobile  and  immobile  regions,  0^  and  0^^,  respectively.  The  mass  frac¬ 
tion  of  sorption  sites  associated  with  the  mobile  fluid  is  f.  Advecti /e 
and  dispersive  transport  components,  that  are  pertinent  to  the  mobile 
region,  appear  on  the  right  side  of  Eq.  1  and  are  expressed  in  terms  of 
v^,  the  average  mobile  pore-water  velocity  and  D^,  the  apparent  dispersion 
coefficient.  Finally,  the  exchange  of  solutes  between  the  mobile  and 
immobile  liquid  regions  is  described  using  a  first-order  mass  transfer 
coefficient, 

Sorption  in  both  the  mobile  and  immobile  regions  of  the  solid  matrix 
is  assumed  to  be  linear,  revet^sioie  and  instantaneous.  Therefore,  in  each 
region  the  sorbed  contaminant  conccn' nation  i« 

5.  =  {3a) 
and 


5,™  =  (3b) 

where  is  the  distribution  coefficient  for  linear  adsorption.  The  time 
rate  of  change  of  sorbed  concentrations  is 


ac, 

at  ■  ^  at 


(4a) 


and 
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{4b) 


^<=‘~dr 


The  results  presented  herein  are  from  one  dimensional  miscible 
displacement  experiments,  for  which  a  pulse  of  contaminant  is  introduced 
over  a  time  period  t^.  In  experiments,  where  tran;  ort  related  nonequi¬ 
librium  was  considered,  system  behavior  can  be  approximated  by  solving 
Eqs.  i  and  Z  under  the  following  initial  and  boundary  conditions 


c^ix.o)  =  C^^ix.o)  =  0 


S^ix.o)  =  S^^ix.o)  =  0 


5c 

-n  _ m  +  V  c 

"  dx  " 


V  c 


0  <  t  ^  t. 


-D„  +  V„C„  =  0 

"  dx  ”  v.o 


t  >  t. 


dx  \x^L 


t  >  0 


For  miscible  displacement  experiments  conducted  with  soil  columns  of 
length,  L,  a  convenient  dimensionless  expression  for  the  governing 

transport  equations  is  the  following 

o  5C  ,  dC,  1  d^C,  da 

*  (1  -  fi)R^  ,  1-^-  ^  (6) 

for  the  mobile  region  and 


(1  -  P)^-^  =  0)(C,  -  C,) 
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for  the  immobile  region,  where  the  dimensionless  variables  are  defined 


=  cjc. 


C2  =  CJC, 


/n  m 

lQ 


X  =  - 


X 

L 


P  = 


i?  =  1  + 


0 


P  = 


0) 


Qm  ^  -^P^d 
0  +  piC^ 


k^L 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 


(14) 


Eqs.  6  and  7  were  derived  from  Eqs.  1  and  2,  using  Eq.  4  to  eliminate 
and  s.^;  and,  then  using  Eqs.  8  through  14  to  substitute  dimensionless 

variables  for  dimensional  variables.  In  a  similar  fashion,  initial  and 

boundary  conditions  were  rewritten  below  in  terms  of  the  dimensionless 
variables 

0)  =  0  (15a) 

c^iX.Q)  =  0  (15b) 
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,Y=0 


1 


(15c) 


1  ^ 

P  dX 


+  C, 


0  <  T  ^ 


1 

P 


aq 

"ax 


+ 


^■*0 


0 


T>  q 


(15d) 


aq 

~dx 


x-i 


0 


r  >  0 


(ISe) 


3.1.2  Model  2.  The  Two  Site  Model 

Early  solute  arrival,  observed  asymmetry,  and  tailing  in  BTCs  may 
also  occur  as  a  result  of  heterogeneities  in  sorption  sites.  Differences 
in  the  rates  at  which  solutes  sorb  creates  a  sorption  related  nonequilib¬ 
rium  condition  affecting  only  sorbing  solutes.  The  two  site  nonequilib¬ 
rium  model,  proposed  by  Selim  et  al .  (1976)  and  Cameron  and  Klute  (1977), 
presumes  that  the  complex  saturated  media  may  be  treated  as  a  two  site 
sorption  domain;  sorption  on  type-1  sites  is  instantaneous,  while  sorption 
on  type-2  sites  is  time-dependent.  Letting  F  equal  the  mass  fraction  of 
sorption  capacity  associated  with  type-1  sites,  the  governing  equations 
for  the  transport  model  are 


dC  dSy  ^  32p  3/- 

dt  dt  dC  dx^ 


(16) 


and 

^  [(1  -  F)  K^C  -  q]  (17) 

where  C  is  the  dissolved  solute  concentrations,  Sj  is  the  type-1  sorbed 
solute  concentration,  and  is  the  type-2  sorbed  solute  concentration. 

represents  a  first-order  kinetic  rate  coefficient  used  to  describe  the 
rate  limited  sorption  at  type-2  sites.  Since  it  is  assumed  that  all  the 
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fluid  is  moving,  D  and  V  are  taken  to  be  apparent  dispersion  coefficient 
and  pore  velocity  of  the  total  liquid  phase,  respectively. 

Sorption  for  both  sites  is  assumed  to  be  linear  and  reversible; 
thus,  at  equilibrium  sorption  at  both  sites  is  given  by 

5,  =  FK^C  (18) 

and 


52  =  (1  -  F)  K^C 


(19) 


The  time  dependent  change  in  the  sorbed  solute  concentrations  at  type-1 
sites  is 


“aF 


r  be 

^  bt 


(20) 


whereas  rate  limited  exchange  of  solutes  at  type-2  sites  is  expressed  in 
Eq.  17. 

BTCs  are  from  one  dimensional  miscible  displacement  experiments,  for 
which  a  pulse  of  contaminant  is  introduced  over  a  time  period  t^.  The 
appropriate  initial  and  boundary  conditions  under  which  Eqs.  17  and  18 
should  be  solved  are 


C(x,0) 

=  0 

(21a) 

52  (X,  0) 

=  0 

(21b) 

-D  + 

bx 

VC  =  vc^ 

x»0 

0  <  t  ^  to 

(21c) 

-D  4^  + 

bx 

t^Cj  =  0 

1  X'O 

t  >  to 

(21d) 
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t  >  0 


(21e) 


The  transport  model  and  the  associated  initial  and  boundary  condi¬ 
tions  can  be  reduced  to  the  same  dimensionless  equations  derived  for  the 
two  region  model  (Eqs.  6,  7,  and  15).  The  dimensionless  model  is  obtained 
using  the  following  definitions  for  dimensionless  variables 


q  = 


^2  = 


il  -  F) 


/?  =  1  + 


9  ^  FpK^ 
e  *  pK^ 

k^il  -  p  )  /?L 


3.1.3  Model  3.  The  One  Site  Model 

In  this  nonequil ibrium  model ,  sorption  is  assumed  to  be  completely 
rate  limited,  and  it  is  approximated  with  first-order  kinetics;  thus,  the 
governing  equations  to  the  transport  model  are 


0 .  p|s 

dt  dt 


VQ  4^ 
ax 
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in  which  s  is  the  sorbed  species  concentration,  and  is  the  desorption 
rate  coefficient  for  a  linear  reversible  sorption  reaction.  To  approxi¬ 
mate  conditions  of  the  miscible  displacr  ent  experiments,  Eqs.  26  and  27 
should  be  solved  under  the  following  in,,.ial  and  boundary  conditions 


C(j!C,0)  =  0 


S{x,Q)  =  0 


-O  If  *  ’.'C  =  VC, 

ax  v.o 


■D  ^  +  VC  =0 

OX  x-0 


0  <  C  ^  t. 


t  >  t. 


t  >  0 


(30a) 


(30b) 


(30c) 


(30d) 


(30e) 


The  one-site  model  can  be  expressed  in  the  same  convenient  dimen¬ 
sionless  form  used  previously  with  models  1  and  2.  The  dimensionless 
variables  X,  2  ,  P,  and  R  are  defined  as  in  model  2;  however,  the  following 
dimensionless  variables  are  also  needed 


1 


Likewise,  with  appropriate  variable  substitutions,  the  initial  and  boun¬ 
dary  conditions  transform  to  the  same  dimensionless  form  obtained  with 
models  1  and  2.  By  expressing  model  3  in  the  same  dimensionless  form  as 
model  2,  it  becomes  evident  that  the  two  models  approach  equivalency  as  B 
approaches  the  reciprocal  of  the  retardation  factor.  B  equals  the  reci¬ 
procal  of  R  when,  F,  the  mass  fraction  of  instantaneous  sorption  sites  is 
zero.  It  is  also  possible  for  B  to  approach  a  value  of  1/i?  if  F  is  less- 
than-or-equal  to  the  reciprocal  of  R  and  the  porous  media  has  sufficiently 
high  sorptive  capacity. 

3.2  MATERIALS  and  METHODS 
3.2.1  Column  Experiments 

Breakthrough  curves  (BTCs)  of  pentafluorobenzoic  acid  (PFBA),  ben¬ 
zene,  xylene,  PCE,  toluene,  and  1 -methyl naphthalene  were  measured  during 
aqueous  elution  through  soil  columns  totally  saturated  with  water  and 
through  soil  columns  containing  water  and  a  residual  decane  saturation. 
Fig.  2  shows  a  schematic  of  the  experimental  setup.  The  soil  used  was  a 
fine  sand.  A  sieve  analysis  on  the  sand  determined  the  effective  size  and 
uniformity  coefficient  to  be  0.24  mm  and  1.67,  respectively.  The  density 
of  the  sand  grains  was  estimated  at  2.68  g/cm^.  The  organic  mass  fraction 
of  the  sand  was  determined  using  a  LECO  112  carbon  determinator  and  found 
to  be  0.02  percent. 

Soil  was  packed  in  stainless  steel  columns  with  an  inside  diameter 
of  2.12  cm  and  with  lengths  ranging  from  6.78  to  7.11  cm.  All  valves  and 
tubing  were  stainless  steel.  The  soil  columns  were  saturated  with  a 
solution  of  0.01  N  CaCl^  containing  10  mg/1  mercuric  chloride.  Soil 
columns  were  initially  exposed  to  several  column  pore  volumes  of  the  CaCl^ 
solution  until  saturated.  An  aqueous  solution  of  100.0  mg/1  PFBA  and  .01 
N  CaCl^  was  used  to  characterize  pore  water  velocities,  column  void  vol¬ 
umes,  column  Peclet  numbers,  and  natural  soil  dispersivities.  Next,  a 
decane  residual  was  emplaced  on  the  soil  matrix  using  a  procedure  of  first 
eluting  the  columns  with  several  pore  volumes  of  decane,  followed  by 
several  pore  volumes  of  the  CaCl^  solution.  Column  elutions  with  the  CaCl^ 
solution  continued  until  no  residual  decane  was  detected  in  the  eluent. 
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Schematic  of  Experimental  System 


Again,  an  aqueous  solution  of  PFBA,  was  used  to  characterize  the  column 
void  volumes,  pore  water  velocities,  Peclet  numbers  and  dispersivities  in 
the  soil-SNAPL  columns.  Table  1  summarizes  the  soil  column  charac¬ 
teristics  obtained  by  measurement  and  through  modeling  of  PFBA  BTCs. 
Benzene,  toluene,  PCE,  xylene,  and  1-methylnaphthalene  were  individually 
eluted  as  dissolved  solutes  in  the  CaCl2  solution.  The  organic  solution 
concentrations  were  generally  less  than  50  percent  of  reported  aqueous- 
phase  solubilities. 

3.2.2  Saturation  Flask  Experiments  and  Residual  Decane  Determinations 
Decane-water  partition  coefficients  were  determined  for  each  reac¬ 
tive  tracer  used  in  the  study.  Following  the  approach  of  MacIntyre  and 
Stauffer  (1988),  saturation  flasks  containing  350  ml  of  distilled  water 
were  prepared  for  each  organic  tracer.  Next,  25  ml  of  a  solution,  con¬ 
taining  49.5  ml  decane  and  0.5  ml  of  tracer  (i.e.,  benzene,  toluene,  PCE, 
xylene,  or  1-methylnaphthalene),  was  carefully  added  to  each  flask  above 
the  water  surface.  Each  flask  was  allowed  to  sit  undisturbed  for  one 
week,  after  which  tracer  concentrations  were  analyzed  in  both  water  and 
decane  phases  using  GC.  Sampling  and  subsequent  analyses  were  repeated 
three  times  for  each  tracer.  The  decane-water  partition  coefficients  were 
calculated  using  the  following  equation 

^  _  [  {mass  of  tracer  in  decane)  /  ( total  mass  of  decane)  ]  ,^4) 
^  [  (mass  of  tracer  in  water)  /  (total  mass  of  water)  ] 

All  soil  columns  were  sacrificed  and  analyzed  for  decane  after  the 
completion  of  planned  miscible  displacement  experiments.  The  stainless 
steel  column  end  fittings  were  removed  and  the  sand-decane-water  contents 
were  carefully  sectioned  into  five  equal  samples.  The  decane  was  then 
extracted  from  the  samples  with  hexane.  The  hexane  extracts  were  analyzed 
by  GC  to  obtain  the  mass  of  decane  extracted  per  unit  mass  of  sand. 

3.2.3  Model  Parameter  Estimation  bv  Nonlinear  Least-squares  Inversion 
Transport  model  parameters  v,  v^,  d,  d^,  r,  B,  and  u  were  esti¬ 
mated  using  software  developed  by  Parker  and  van  Genuchten  (1984).  The 
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Table  1.  Basic  Soil  Column  Characteristics  Obtained  Through  PFBA  BTCs 
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Calculated  from  the  measured  density  of  sand  and  the  measured  residual  decane 
Calculated  from  results  of  fitting  model  1  to  PFBA  BTCs 


software  uses  nonlinear  least-squares  inversion  to  identify  parameter 
values  that  minimize  the  sum  deviations  between  analytical  model  estimates 
(i.e.,  model  1,  2  or  3)  and  observed  solute  concentrations.  The  software 
was  originally  developed  to  fit  models  one  and  two;  however,  parameter 
values  for  model  3  can  be  identified  using  this  software  if  Cp  Cp  IB,  and 
(i)  are  defined  as  specified  in  Eqs.  31  through  33. 

PFBA  has  been  used  successfully  in  the  past  as  a  nonreactive  tracer; 
thus,  asymmetry  or  tailing  in  PFBA  BTC  would  indicate  the  presence  of 
transport  related  nonequilibrium  conditions.  Model  1  was  used  to  simulate 
BTCs  for  PFBA,  which  in  turn,  identified  characteristic  pore  water 
velocities,  dispersion  coefficients,  and  Peclet  numbers  for  each  soil 
column.  In  subsequent  efforts  to  identify  model  2  parameters  for  reactive 
tracers,  column  Peclet  numbers,  from  corresponding  PFBA  experiments,  were 
used  in  the  model  fitting  process  but  not  allowed  to  vary;  this  left 
parameters  R,  T^,  B,  and  u  to  vary. 

Initial  estimates  of  parameters  are  needed  to  begin  the  iterative 
search  for  values  giving  the  best  mode!  fit.  First  estimates  were 
obtained  using  solute  concentration  moments  from  BTCs.  Using  the  same 
notation  as  Valocchi  (1985),  the  nth  absolute  temporal  moment  for  the  BTC 
is  defined  as 


=  r  T^C^iX.T)  dT 
J  O 


(35) 


in  which  T  and  X  are  the  dimensionless  time  and  distance  variables  defined 
previously.  Having  estimates  of  the  soil  column  porosity  and  the  rate  at 
which  water  is  eluted  through  the  column,  the  zero  absolute  moment  gives 
the  total  mass  of  contaminant  injected  per  unit  of  column  void  volume. 
When  working  with  dimensionless  concentrations,  the  zero  moment  gives  the 
length  of  the  injection  pulse,  T^,  expressed  in  column  pore  volumes.  The 
nth  normalized  absolute  moment  is  expressed  as 

r  T"  q  {X,  T)  dT 

J  O 


Hn  = 


r  Cl  (X,  T)  dT 

^  O 


(36) 


25-19 


The  first  normalized  moment  of  the  BTC  represent  the  time  on  the  BTC  where 
half  the  solute  mass  has  been  eluted  from  the  column.  The  retardation 
factor,  K,  was  obtained  from  the  first  normalized  moment.  The  nth  central 
moment  is  defined  as 


= 


r  (T  -  p,)"  C^(X,T)  dT 

J  o _ 

r  q  (X,  T)  dT 
J  o 


The  second  and  third  central  moments  were  used  to  obtain  estimates  of  (3 
and  0).  Valocchi  (1985)  gives  moment  equations  for  dirac  contaminant 
loads.  Used  in  this  study,  but  derived  from  Valocchi 's  equations,  are 
moment  equations  appropriate  for  pulse  contaminant  injections  in  one 
dimensional  miscible  displacement  experiments  (see  Table  2). 


3.3  RESULTS  and  DISCUSSION 

3.3.1  Nonreactive  Tracer  Displacement 

PFBA  was  used  as  a  nonreactive  tracer  to  characterize  the  physical 
transport  parameters  in  soil  columns  both  before  and  after  residual  decane 
emplacement.  Table  3  summarizes  the  optimum  parameter  values  obtained 
from  fitting  model  1  to  the  BTCs  of  PFBA.  For  several  soil  columns, 
miscible  displacement  experiments  were  conducted  over  a  range  of  fluxes  in 
order  to  observe  model  parameter  dependency  on  pore  water  velocities. 
Fig.  3  shows  typical  BTCs  for  PFBA  from  a  column  before  and  after  an 
emplaced  decane  residual.  This  figure  shows  the  accelerated  arrival  of 
the  tracer  after  a  decane  residual  was  created  in  the  soil  column.  Under 
the  same  fluid  flux,  the  presence  of  residual  decane  increased  pore  water 
velocities  and  dispersivities  however,  column  Peclet  numbers 
consistently  decreased  after  decane  Tlacement.  Decane  saturation,  0^, 
did  not  appear  to  have  any  consiste  effect  on  the  fraction  of  mobile 
water.  Values  of  (3  were  less  than  one  but  generally  greater  than  0.90. 

An  analysis  was  performed  to  evaluate  the  sensitivity  of  the  general 
dimensionless  model  under  various  values  of  (3  and  u.  The  analysis  lead  to 
the  derivation  of  the  following  equation 
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Table  2.  Time  Moment  Formulas  for  Pulse  Input 


Moment 


Model 

1,  2  and  3 
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Table  3.  Optimum  Parameter  Values  for  Model  1  Fitted  to  Observed  BTCs  of  PFBA 
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BTC  EXPERIMENTS  9  AND  10 


Fig.  3  PFBA  BTCs  before  (+  observed,  ---  fitted)  and  after  (□  observed, 
—  fitted)  Emplacement  of  Decane  Residual  on  Soil  Column  No.  3. 


25-23 


which  describes  the  sensitivity  of  the  general  dimensionless  model  to 
small  changes  in  the  relative  values  of  B  and  u.  According  to  Eq.  38,  as 
6  values  approach  unity  the  transport  model  becomes  less  responsive  to 
changes  in  the  value  of  u.  This  conclusion  was  verified  through  simula¬ 
tion.  Fig  4  shows  how  the  model  responds  to  relative  changes  in  u,  if  the 
standard  u  is  2.0.  As  the  true  B  approaches  unity,  the  response  curves 
become  flatter.  The  flat  response  curves  could  make  some  u  determinations 
less  reliable  since  a  model  fitting  procedure  was  used.  Table  3  lists 
several  fitted  values  of  u  that  may  be  suspect  because  B  values  were  close 
to  1.0. 

3.3.2  Reactive  Tracer  Displacement 

Table  4  summarizes  miscible  displacement  experiments  conducted  with 
reactive  solutes,  the  columns  used,  whether  residual  decs  was  present, 
the  mode  of  tracer  detection,  the  final  models  used  to  simulate  BTCs,  and 
the  fitted  model  parameters.  Looking  first,  at  reactive  tracer  behavior 
in  the  natural  soil  (experiments  18  through  32),  the  retardation  factors 
were  generally  less  than  1.5  with  the  exception  of  1-methylnaphthalene. 
For  benzene  and  toluene,  B  values  were  greater  than  .90.  Values  of  B 
decreased  with  increasing  tracer  hydrophobicity,  while  fitted  values  of  u 
consistently  increased  (see  Table  4,  experiments  18  through  32).  Fig.  5 
shows  typical  BTCs  for  benzene  and  toluene  from  a  column  before  treatment 
with  decane.  Given  the  small  fraction  of  immobile  water  found  from  the 
PFBA  BTCs,  and  the  increasing  asymmetry  seen  in  the  BTCs  of  less  polar 
tracers,  it  was  assumed  that  the  observed  nonequilibrium  was  sorption 
related.  Thus,  model  2  was  selected  to  approximate  reactive  solute 
transport  through  the  water  saturated  soil  columns.  Changes  in  the  fluid 
velocity  did  not  produce  consistent  variations  in  u  values.  The  value  of 
F  was  estimated  by  regressing  the  product  e*i?*B  against  e*(i?-l)  (see  Fig. 
6).  The  regression  equation  indicates  that  33  percent  of  the  sorption 
sites  were  associated  with  instantaneous  sorption.  Another  correlation 


DI^^ESIONLESS  MODEL  SENESITIVITY 

TO  OMEGA 


SUM  SQUARE  OF  DEVIATIONS 


Fig.  4  Dimensionless  Model  Sensitivity  to  w  under  Specified  Values  of  jS 
(□,  p  =  .99;  *,  P  =  .90;  and  x,  /?  =  .70). 
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Table  4.  Optimum  Parameter  Values  foi  Models  1  and  3  Fitted 
to  Observed  BTCs  for  Reactive  Organic  Tracers 
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No,  Y  =  Yes,  UV  =  Ultra  Violet,  SC  =  Scintillation  Counter,  GC  =  Gas  Chromotography, 
Benzene,  T  =  Toluene,  X  =  Xylene,  P  =  PCE,  M  =  1 -methyl naphthalene 


BTC  EXPERIMENTS  18  AND  21 


C/CO 


PORE  VOLUMES 


Fig.  5  BTCs  of  Benzene  (□  observed,  —  fitted)  and  Toluene  (+  observed, 
---  fitted)  before  Emplacement  of  Decane  Residual  on  Soil  Column 
No.  1. 
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THETAMR-1) 

Regression  of  0R/3  against  0(R-1)  for  all  Reactive  and 
Nonreactive  Tracers  before  Emplacement  of  Decane 
Residual  on  Soil  Column  No.  1. 


was  sought  between  logarithmic  values  of  the  first  order  rate  coefficient 
and  the  sorption  coefficient,  Log  linear  relationships  were  visi¬ 
ble  for  xylene,  PCE,  .d  1 -methyl naphthalene  where  experiments  were  con¬ 
ducted  at  the  same  flux  (see  Fig.  7).  Poor  correlations  were  obtained  for 
toluene  and  benzene  regardless  of  the  flux.  In  light  of  the  B-u  relation¬ 
ship  defined  by  Eq.  38,  the  high  6  values  probably  precluded  suitable 
identification  of  u  in  experiments  18  through  23. 

The  emplacement  of  residual  decane  on  the  soil  columns  increased 
tracer  retardation  factors  40  to  650  times.  Fig.  8  shows  BTCs  of  benzene 
and  toluene  after  treating  a  soil  column  with  decane;  this  was  the  same 
experimental  column  used  previously  to  produce  the  BTCs  depicted  in  Fig. 
5  for  decane  free  soil. 

Initial  efforts  to  use  model  2  to  predict  BTCs  were  not  successful. 
In  general,  the  fitting  program  forced  6  to  zero,  suggesting  instantaneous 
sorption  was  a  minor  component  of  the  total  sorptive  capacity  of  SNAPL- 
Soil  matrix.  This  indicates  that  solute  exchange  between  water  and  decane 
was  for  the  most  part  rate  limiting,  and  that  model  3  could  be  applicable. 
In  all  subsequent  model  fitting  efforts,  model  3  was  successfully  used  to 
predict  BTCs  for  reactive  tracers  from  soil  columns  containing  residual 
decane. 

Another  correlation  was  sought,  but  this  time  between  the  loga¬ 
rithmic  values  of  the  first  order  rate  coefficient,  ;c3,  and  the  sorption 
coefficient,  K^.  Log  linear  relationships  were  again  visible  between 
tracer  experiments  conducted  at  the  same  flux  (see  Fig.  9);  however,  an 
overall  correlation  was  taken.  Limited  data  precludes  development  of 
separate  regression  equations  for  each  fluid  flux;  nevertheless,  future 
efforts  to  develop  such  correlations  should  consider  an  array  of  experi¬ 
ments  using  a  range  of  fluid  fluxes  and  a  host  of  tracers  with  a  broad 
range  of  K^s. 

Explanations  were  sought  for  why  model  3  worked  on  the  soils  con¬ 
taining  residual  decane.  One  reason,  could  be  that  most  of  the  sorption 
capacity  was  located  in  randomly  distributed  but  isolated  droplets  of 
SNAPL.  If  this  were  true,  it  could  be  argued  for  low  organic  soils,  that 
the  fraction  of  instantaneous  sorption  sites  was  essentially  zero.  When 


25-29 


Fig.  7  First  Order  Rate  Coefficient,  versus  Soil  Sorption  Coefficient, 
before  Emplacement  of  Decane  Residual  on  Soil  Column  No.  1 
(B  =  benzene,  T  =  toluene,  X  =  xylene,  P  =  PCE,  and  M  =  1-methyl - 
naphthalene;  at  fluid  fluxes,  □  =  .1415  cm  min  ',  x  =  .283/cm 

min’^  and  +  =  .8492  cm  min’^). 
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BTC  EXPERIMENTS  33  AND  36 


C/CO 


PORE  VOLUMES 


Fig.  8  BTCs  of  Benzene  (□  observed,  —  fitted)  and  Toluene 
(+  observed,  ---  fitted)  after  Emplacement  of  Oecane 
Residual  on  Soil  Column  No.  2. 
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1.0 


10.0 


100.0 


1000.0 


Kd 

Fig.  9  First  Order  Rate  Coefficient,  versus  SNAPL-Soil  Sorption 
Coefficient,  after  Emplacement  of  Decane  Residual.  Exp. 
Nos.  33  to  42  for  Fluid  Fluxes  (x  =  .1416  cm  min'\  +  =  .2831 
cm  min'^  and  □  =  .8492  cm  min'V 
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F  is  zero,  model  2  reduces  to  model  3  and  B  equals  l/R.  This  explanation 
is  plausible,  if  tracer  exchange  is  restricted  by  the  available  area 
through  which  solutes  pass  between  the  aqueous  and  organic  phases. 

In  the  preparation  of  soil  columns,  decane  was  eluted  through  soils 
which  had  been  previously  saturated  with  water.  The  elution  of  decane 
entrapped  water  in  the  smaller  pores.  In  a  similar  manner,  when  water  was 
subsequently  used  to  displace  excess  decane,  a  residual  organic  phase 
could  have  been  entrapped  as  semi-isolated  droplets  between  grains  of 
sand. 

Another  possible  explanation  for  the  success  of  model  3,  could  be 
that  F  is  not  zcio,  rather,  the  sorption  capacity  of  the  SNAPL-soil  matrix 
is  so  large,  that  B  (from  model  2)  approaches  the  value  of  F.  If  the 
value  of  F  is  less-than-or-equal -to  l/R,  then  predictions  from  model  2 
would  either  equal  or  closely  resemble  predictions  from  model  3. 

3.4  INDEPENDENT  MODEL  PARAMETER  ESTIMATION 

It  is  often  the  case  with  modeling  that  parameters  values  are  needed 
which  cannot  be  measured  conveniently  or  directly;  consequently,  it  is 
desirable  to  develop  a  capability  of  estimating  model  parameters 
independently  of  experiment.  In  this  study,  B,  u,  p,  and  R  were  the  most 
important  model  parameters  for  reactive  tracers.  In  experiments  with  soil 
columns  partially  saturated  with  decane,  sorption  had  such  a  dominant 
influence  on  the  shape  of  BTC,  that  it  masked  the  influence  of  the  Peclet 
number. 

In  this  study  an  effort  was  made  to  estimate  B,  u,  and  R  indepen¬ 
dently;  however,  the  greatest  success  was  with  the  estimation  of  R  and  B. 
Table  5  shows  the  values  of  R  estimated  by  two  independent  methods,  and 
my  moments  analysis.  Both  of  the  independent  methods  used  measured 
decane-water  partition  coefficients  and  sorption  coefficients  for  the 
sand.  The  methods  differ  in  the  manner  by  which  the  available  void  volume 
and  the  residual  decane  is  estimated.  In  method  1,  the  residual  decane 
and  the  column  void  volume  are  determined  from  the  estimated  pore  velo¬ 
cities  given  by  the  PFBA  BTCs.  Method  2  makes  use  of  the  column  void 
volumes  and  decane  residuals  obtained  through  measured  soil  densities  and 
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Table  5.  Optimum  Fitted  Retardation  Factors  versus  Independent  Estimates 
and  Calculated  Retardation  Factors  from  Moments 
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direct  measurement  of  residual  decane.  Once  the  decane  residual  and  the 
column  void  volume  have  been  estimated,  the  calculation  of  R  is  the  same 
between  the  two  methods.  When  a  comparison  is  drawn  between  indcjeddc:.  j 
derived  Rs  and  those  obtained  through  model  fit,  it  may  be  concluded  that 
both  independent  methods  are  good  estimators.  Moment  estimates  were  also 
excellent;  however,  this  method  requires  an  experimental  that 
independent  methods  avoid. 

The  successful  estimation  of  retardation  factors  meant  that  good 
estimates  of  (3  were  likely;  this  was  because  the  BTCs  from  experiments  33 
to  42  were  easily  modeled  as  one  site  systems.  Recall  that  in  model  3,  B 
is  estimated  simply  from  the  reciprocal  of  the  retardation  factor.  Table 
6  shows  independently  estimated  B  values  for  model  3  as  well  as  those 
obtained  through  model  fitting;  again  a  comparison  will  verify  that 
methods  of  independent  determination  are  quite  reliable. 

When  B  values  are  as  small  as  those  reported  in  table  6,  it  is  evi¬ 
dent  from  Eq.  38,  that  any  model  response  to  a  change  in  this  parameter  is 
proportional  to  the  value  of  B.  Given  in  table  6  are  B  values  calculated 
as  if  model  2  was  being  used.  It  should  be  noted  that  B  values  calculated 
for  model  2  are  not  that  different  from  the  B  estimates  for  model  3;  this 
is  because,  large  relative  changes  in  this  parameter  will  have  little 
influence  on  simulated  results  if  the  true  value  of  B  is  small. 

4.0  PREDICTING  SORPTION  SYSTEM  PERFORMANCE  THROUGH  EFFECTIVE  RETARDATION 
4.1  THEORY 

Contaminant  fluxes  can  be  estimated  from  pollutant  concentrations 
and  the  velocity  of  the  groundwater  flow.  Contaminant  flux  reductions 
achieved  by  an  SSS  depend  on  the  spatial  distribution  of  SNAPL  which 
controls  spatial  variations  in  the  sorptive  capacity  and  permeability  of 
the  soil.  Designing  an  SSS  for  the  field  is  complicated  because  several 
parameters  affecting  system  performance  require  simultaneous  considera¬ 
tion.  The  most  salient  parameters  are:  the  residual  SNAPL  saturation,  the 
permeability  of  the  system  and  the  surrounding  aquifer,  the  geometrical 
configuration  of  the  SSS,  and  the  sorption  characteristics  of  the  natural 
aquifer  and  the  sand/SNAPL  matrix.  The  complexity  of  the  problem  is 
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Using  PFBA  BTCs,  measured  KpS  from  saturation  flasks,  and  natural  soil  K^^s 

Using  measured  sand  densities,  decane  residuals  K  s  from  saturation  flasks,  and  natural  soil  K^^s 


reduced,  however,  if  remediation  benefits  obtained  from  an  SSS  are 
expressed  in  terms  of  the  'effective  retardation'.  The  effective 
retardation  is  defined  here  as  the  dimensionless  ratio  of  contaminant 
velocity  under  ambient  conditions,  to  the  new  velocity  created  with  the 
installation  of  the  SSS.  The  effective  retardation  is  a  spatially  varying 
parameter  of  the  groundwater  flow  domain;  it  is  the  factor  that  translates 
ambient  contaminant  velocities  to  what  they  will  be  once  the  sorption 
system  is  installed. 

General  analytical  expressions  for  the  effective  retardation  at  all 
points  within  a  flow  domain  are  derived  for  the  hypothetical  sorption 
system  illustrated  in  Fig.  10;  this  figure  shows  a  circular  sorption  field 
centered  over  the  origin  of  an  orthogonal  pair  of  axes.  If  a  profile  view 
were  given,  it  would  verify  that  the  system  is  actually  a  vertical 
cylinder  having  a  height  equal  to  the  thickness  of  the  saturated  zone  and 
a  horizontal  radial  dimension  i?^.  Inside  the  SSS,  the  porous  media  has 
homogeneous  sorptive  characteristics,  porosity,  and  permeability.  The 
aquifer  surrounding  the  sorption  system  is  also  homogeneous;  however,  the 
sorption  characteristics,  porosity,  and  permeability  of  the  porous  media 
are  different  from  that  of  the  soil-SNAPL  matrix  in  the  SSS. 

Derivation  of  the  effective  retardation  functions  for  the  cylindri¬ 
cal  system  involves  elementary  manipulations  with  complex  numbers.  For 
convenience  the  location  of  the  SSS  is  centered  over  the  origin  of  complex 
coordinate  plane  (see  Fig.  10).  The  region  inside  the  boundaries  of  the 
system  will  be  referred  to  as  the  SSS,  the  flow  domain  surrounding  SSS 
will  be  designated  simply  the  surrounding  aquifer  (SA). 

The  derivation  of  the  effective  retardation  equations  begins  with  an 
assumption  that  transport  is  primarily  advective  (i.e.,  dispersive  tran¬ 
sport  is  insignificant).  Next,  an  allowance  is  made  that  contaminant 
sorption  to  the  soil  and  SNAPL  is  linear,  reversible,  ana  at  equilibrium. 
With  these  two  initial  assumptions,  the  transport  of  a  contaminant  is 
given  by  the  local  apparent  velocity 


q 

R  0 


(39) 
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Fig.  10  Plan  View  of  a  Hypothetical  Cylindrical  SSS  (Region  A) 
Possessing  Greater  Permeability  than  the  Surrounding 
Aquifer  (Region  B) . 
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where,  0  is  the  water  saturated  porosity  and  R  is  the  retardation  factor 
due  to  the  sorption  assumption.  The  retardation  factor  is  the  ratio  of 
the  average  linear  velocity  of  the  groundwater  to  the  velocity  of  a 
sorbing  contaminant,  g  is  the  resultant  Darcian  velocity  obtained  from 
discharge  components  in  the  horizontal  x  and  y  directions 

g  =  +  (40) 


The  direction  of  this  flow  is  0 
<p  =  Arctang 

The  retardation  factor  is  estimated  from 


(41) 


(42) 


in  which  p  is  the  bulk  density  of  the  permeable  matrix,  and  is  the 
matrix/water  sorption  coefficient  for  a  target  contaminant.  Though  other 
formulations  for  the  retardation  factor  exist  (Hamaker  1975),  Endfield 
(1982)  produced  a  lucid  derivation  of  Eq.  42  from  assumptions  that 
sorption  was  linear,  reversible,  and  instantaneous. 

Under  natural  soil  conditions,  the  retardation  factor  is  calculated 
by  letting  p  equal  p^  the  bulk  density  of  the  soil,  0^  be  the  total  soil 
porosity  0,  and  equal  the  soil/water  sorption  coefficient  for  a 
target  contaminant.  The  sorption  coefficient  may  be  obtained  experiment¬ 
ally  or  estimated  by  any  number  of  empirical  formulae;  however,  a  commonly 
used  equation  has  been  that  from  Karickhoff  (1979) 


K.  =  0.63 

as  ow  oc 


(43) 


where  is  the  octanol/water  partition  coefficient  for  the  target 
contaminant  and  is  the  fraction  of  organic  carbon  associated  with  the 
soil . 
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Eq.  42  applies  inside  the  SSS  as  well;  however,  values  for  0  and  R 
must  reflect  the  presence  of  the  SNAPL.  The  water  saturated  porosity  is 
obtained  from  knowing  the  SNAPL  filled  porosity  6^. 

e  =  03-0,  (44) 

The  retardation  factor  inside  the  SSS  is  calculated  from  Eq.  42,  using  the 
newly  determined  water  saturated  porosity  and  values  for  p  and  obtained 
from  the  following 

P  =  Ps  ^  ®dPd 


and 

^d 


(^ds  Ps  ^  ^pQd  Pd) 
(  Ps  ^  ®d  Pd) 


(46) 


in  which  p,  is  the  density  of  the  SNAPL  and  is  the  SNAPL/water  partition 
coefficient  for  a  targeted  contaminant. 

The  effective  retardation,  J?^,  at  a  location  in  the  flow  domain  is 
the  ratio  of  ambient  contaminant  velocity  to  the  velocity  after  the  SSS  is 
installed.  For  locations  inside  the  SSS  and  in  the  SA,  respectively,  the 
effective  retardations  are  expressed  as 


p  I 

■"e  I  SSS 

and 


V, 


CO 


1  SSS 


■^e  i  SA 
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^clsA 


(47a) 


(47b) 


where  is  the  ambient  contaminant  velocity,  V  L..  is  contaminant  velo- 
city  in  the  SSS,  and  is  the  contaminant  velocity  in  the  SA. 
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In  the  development  of  the  general  effective  retardation  equations  it 
is  assumed  that  groundwater  flow  is  uniform  and  parallel  to  the  x  axis  in 
our  hypothetical  problem;  hence,  q  is  initially  equated  to  for  the 
entire  flow  domain.  It  follows  from  this  assumption,  that  the  ambient 
contaminant  velocity  is  found  by  combining  Eqs.  39,  42,  and  43. 

po 


V. 


‘JO 
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+  0.63  K  F 
ou  oc 


(48) 


After  the  sorption  system  is  installed,  the  same  substitutions  described 
above  define  the  apparent  contaminant  velocity  in  the  SA  as 


V  I 

C  I  SA 


_ g _ 

10  +  0.63  K  F  p 

ow  oc 


(49) 


Finally,  the  apparent  contaminated  velocity  inside  the  SSS  is  obtained 
from  combining  Eqs.  39,  42,  44,  45  and  46. 


eJ  SSS 


0  -  0j  +  0.63  K 
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OW  OC 
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(50) 


Analytical  formulations  for  the  effective  retardation  inside  the  SSS 
and  in  the  SA  can  be  obtained  using  appropriate  expressions  for  the 
specific  discharge.  Strack  and  Haitjema  (1981)  provide  a  solution  to  a 
specific  groundwater  flow  problem  from  which  the  required  discharge 
expressions  can  be  obtained.  The  solution  is  for  steady  flow  in  a 
regional  two  dimensional  horizontal  aquifer,  that  has  a  cylindrical  homo¬ 
geneous  anomaly;  the  regional  aquifer  is  characterized  with  a  different 
hydraulic  conductivity  than  the  cylindrical  anomaly.  The  solution  incor¬ 
porates  far  field  boundary  conditions  of  uniform  flow  parallel  to  the  x 
axis  and  near  field  conditions  of  continuity  of  discharge  along  the 
boundary  separating  the  SSS  and  the  SA.  For  a  unit  thickness  of  aquifer, 
the  complex  potentials  for  groundwater  flow  in  the  SSS  and  the  SA  are 
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^SA  ^  ^SSS 


sss 


(51a) 


^SA  "  ^sss 


"  ~  ° 
^sss 


(51b) 


in  which  and  are  the  complex  potentials  inside  the  SSS  and  in  the 
SA,  respectively.  The  complex  variable  z  is  a  point  in  the  complex  plane 
of  which  the  origin  of  that  plane  is  coincident  with  the  center  location 
of  the  SSS  (see  Fig.  10).  The  variable  z  equals  x  +  iy,  in  which  x  and 
y  are  distance  variables  and  i  is  the  square  root  of  negative  one. 
is  the  permeability  of  the  SSS,  is  the  natural  permeability  of  the  SA, 
and  *0  is  the  product  of  initial  potentiometric  head  and  hydraulic  con¬ 
ductivity  inside  the  sorption  system. 

The  derivative  of  the  complex  potential  with  respect  to  variable  z 
can  be  used  to  obtain  the  horizontal  specific  discharge  components. 
Inside  the  sorption  system,  the  real  portion  of  the  derivative  of  Eq.  51a 


^sss  ^  -^SA 


(52a) 


The  imaginary  portion  is 


g,  =  0 


(52b) 


Eqs.  52a  and  52b  state  that  flow  inside  the  boundary  of  the  sorption  zone 
is  uniform  and  parallel  to  the  x  axis;  thus. 


^SS  "  ^SA 
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The  permeability  of  the  sorption  zone  is  a  function  of  the  residual 
saturation  of  SNAPL;  therefore,  is  the  product  of  the  relative  permea¬ 
bility  factor  k^,  and  the  natural  permeability  of  the  soil  k^,  used  to 
construct  the  sorption  field.  Relative  permeabilities  have  been  estimated 
previously  (Hatfield  et  al .  1991)  for  same  soil  used  in  the  miscible  dis¬ 
placement  experiments.  The  function  used  then,  to  make  estimates  was  from 
Rose  (1949). 


16sJ  (s  -  s  (1  -  s  ) 

W  '  u  wm'  '  urn' 


(54) 


[2S.  (2  -  3S)  +  2S  S  (3S 

w  \  wjT)/  w  wm  '  w 


2)  -  (4  -  5Sjr 


in  which  is  the  residual  saturation  of  water  when  decane/mineral  oil 
is  flowing  through  the  media,  and  is  the  ratio  of  water  saturated  poro¬ 
sity  to  total  porosity.  The  residual  saturation  of  water  for  the  media 
under  dynamic  conditions  was  obtained  from  column  experiments  in  which  the 
SNAPL  was  used  to  displace  water  in  a  fully  saturated  system.  Mass 
balance  calculations  gave  an  average  residual  water  saturation  of  0.246 
for  the  columns. 

By  combining  Eqs.  47a,  48,  50,  and  53  and  substituting  the  product 
k^k^  for  IC555  an  analytical  expression  is  obtained  for  the  effective  retar¬ 
dation  inside  the  boundaries  of  the  subsurface  sorption  system. 


p  I 
I  SSS 


(6  -  9,  .  0.63  P3  .  9,  p,)  (;c^  k,  .  k^^) 

2(9.  0.63  P3)  k^k^ 


(55) 


4.2  APPLICATION 

Data  from  previous  bench-scaled  experiments  (Hatfield  1989)  were 
used  to  create  hypothetical  plots  of  the  effective  retardation  inside  the 
SSS  (see  Fig.  11).  The  plots  cover  a  range  of  residual  SNAPL  saturations 
that  exceed  the  maximum  value  of  13.9  percent  observed  experimentally 
(Hatfield  1989).  It  is  expected  that  other  SNAPLs  are  likely  to  produce 
effective  retardations  curves  of  similar  shape,  but  may  not  be  limited  to 
the  range  of  residual  saturations  found  experimentally;  thus,  the  curves 
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EFFECTIVE  RETARDATION 
IN  THE  SORPTION  SYSTEM 


EFFECTIVE  RETARDATION 


RESIDUAL  SNARL  SATURATION 


Fig.  11  The  Effective  Retardation  in  a  Cylindrical  Hypothetical  SSS 
(Z  =  +  =  l-methylnaphtha1ene;  —  =  Rs^j/R^;  and 

0  =  Nonreactive  tracer). 
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presented  in  this  paper  serve  as  a  general  model  for  effective  retardation 
at  both  high  and  low  residual  SNAPL  saturations.  A  family  of  curves  for 
both  1-methylnaphthalene  and  a  nonreactive  contaminant  are  shown;  the  non¬ 
reactive  contaminant  is  any  dissolved  constituent  that  is  nonsorbing  and 
conservative.  Values  of  a  dimensionless  parameter  z,  appear  in  Fig.  11 
in  association  with  each  effective  retardation  curve.  Z  represents  the 
ratio  of  permeability  in  the  SSS  at  zero  residual  SNAPL  saturation  to  the 
natural  permeability  of  the  surrounding  aquifer. 

The  nonreactive  contaminant  curves  were  developed  using  Eq.  55. 
Both  sorption  parameters  Kp  and  were  set  to  zero,  and  Eq.  54  was  used 
in  conjunction  with  an  experimental  value  for  the  residual  water  satura¬ 
tion  to  define  the  relative  permeability  k^. 

From  the  nonreactive  curves,  effective  retardation  is  not  greatly 
affected  by  changes  in  residual  SNAPL  saturations  in  the  SSS.  The  small 
increases  in  the  pore  velocity,  caused  by  decreasing  water  filled  poro¬ 
sity,  are  being  canceled  by  a  slightly  greater  decreases  in  the  flow 
caused  by  decreasing  SSS  permeability.  The  parameter  which  is  most  impor¬ 
tant  is  the  permeability  ratio  z.  When  z  equaled  0.1,  the  permeability 
of  the  soil  in  the  SSS  was  10  times  less  than  that  of  the  surrounding 
aquifer;  streamlines  diverged  as  they  approached  the  SSS  and  pore  velo¬ 
cities  decreased  below  ambient  rates.  The  slower  pore  velocities  produced 
effective  retardations  which  ranged  from  5.5  to  6.8  for  residual  SNAPL 
saturation  less  than  13.9  percent.  For  a  SSS  constructed  with  a  soil 
having  a  permeability  10  times  greater  than  the  SA,  streamlines  converged 
near  the  sorption  field,  indicating  an  acceleration  of  flow  toward  the  SSS 
{this  is  the  case  depicted  in  Fig.  10).  The  accelerated  flow  gave  an 
effective  retardation  that  was  less  than  one,  meaning  nonreactive  pollu¬ 
tants  moved  faster  after  SSS  installation. 

The  effective  retardation  curves  for  1-methylnaph*'  lene  illustrate 
a  potentially  significant  contaminant  velocity  reduction  through  the  SSS. 
These  curves  were  obtained  in  the  same  manner  as  described  above,  except 
that  Kp  equaled  an  experimental  value  of  11100  1/kg  and  equaled  7413 
1/kg  for  1-methylnaphthalene  (Sangster  1989).  The  fraction  of  natural 
organic  matter  in  the  soil  was  assumed  to  be  0.02  percent  for  the  SSS  and 
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the  surrounding  aquifer.  For  a  given  value  of  z,  the  reactive  contaminant 
curves  originated  from  the  same  starting  points  as  the  nonreactive  curves; 
however,  1 -methyl naphthalene  sorption  increased  rapidly  with  residual 
SNARL  saturation.  The  increase  in  sorption  caused  reactive  effective 
retardation  curves  to  diverge  rapidly  from  that  of  nonreactive  consti¬ 
tuents.  In  the  range  of  observed  decane/mineral  oil  residuals,  sorption 
alone  could  potentially  increase  retardation  of  methyl  naphthalene  two 
orders  of  magnitude  above  a  nonreactive  constituent  under  the  same  flow 
conditions.  Estimated  effective  retardations  ranged  as  high  as  1630  when 
z  equaled  0.1  and  as  low  as  120  when  Z  equaled  10.0.  For  the  latter  case 
of  Z  equal  to  10,  the  velocity  of  nonreactive  constituents  would  increase 
as  groundwater  flow  converges  upon  the  SSS;  nevertheless,  the  migration 
rate  of  1-methylnaphthalene  would  decrease  by  a  factor  of  120  because  of 
the  SSS. 

A  first  order  approximation  of  the  SSS  capacity  to  retain  con¬ 
taminants  may  be  estimated  in  terms  of  the  system's  effective  retardation. 
This  translates  ambient  contaminant  velocities  to  what  they  will  be  once 
the  sorption  system  is  installed.  An  estimate  of  the  time  needed  to 
exhaust  a  SSS  can  be  obtained  from 
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0.5 


•^e  •^e  I  SSS 
^co 


(56) 


Tg  5  is  the  time  needed  to  achieve  at  the  down  gradient  edge  of  the  sorp¬ 
tion  field,  a  contaminant  concentration  which  is  one  half  of  a  constant 
resident  concentration  imposed  at  the  entrance  of  the  SSS.  is  an 
equivalent  travel  length  through  the  SSS.  Sufficient  radius  would  be 
needed  to  ensure  a  contaminant  plume  was  intercepted. 

In  the  past,  engineers  may  have  been  tempted  to  avoid  the  complexity 
of  the  above  analysis  and  estimate  effective  retardation  by  taking  the 
ratio  of  the  retardation  factor  inside  the  system  ^o  that  for  ambient 
conditions  This  approach  ignores  the  retardation  caused  by  spatial 
changes  in  permeability;  hence,  ignoring  the  changes  in  the  velocity  field 
that  result  after  the  SSS  is  installed.  The  retardation  factor  for  the 
ambient  condition  is  found  by  substituting  Eq.  43  into  Eq.  42.  R^^^  is 
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obtained  after  Eqs.  43,  44,  45,  and  46  are  combined  with  Eq.  42.  The 
ratio  of  the  retardation  factors  is  plotted  in  Fig.  10.  It  now  becomes 
evident,  that  the  ratio  of  retardation  factors  is  an  inappropriate  substi¬ 
tute  for  Eq.  55,  unless  the  permeability  and  water  filled  porosity  of  the 
soil  in  the  SSS  is  close  to  that  of  the  surrounding  media. 

In  the  domain  outside  the  boundary  of  the  sorption  system  the  effec¬ 
tive  retardation  is  obtained  from  the  resultant  specific  discharge  for 
flow  in  this  region.  The  specific  discharge  components  of  q  in  the  SA  are 
found  from  complex  differentiation  of  Eq.  51b. 
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The  real  portion  of  Eq.  57  equals  the  specific  discharge  component  along 
the  X  axis;  thus. 
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The  imaginary  part  gives 
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The  resultant  specific  discharge  q  in  the  SA  is  then 
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Eq.  60  is  valid  for  any  combination  of  x  and  y  where  (x  ^  ^  y^)  >R^. 

In  contrast  to  the  region  inside  the  sorption  system,  the  specific  dis¬ 
charge  is  not  uniform  throughout  the  domain  of  the  SA.  It  is  evident, 
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however,  that  at  large  distances  from  the  sorption  system,  the  specific 
discharge  approaches  that  of  the  uniform  flow  specified  in  the  far  field. 

An  analytical  expression  for  the  effective  retardation  in  the  SA  is 
found  from  combining  Eqs.  47,  48,  49,  and  60;  the  expression  is  simply  the 
reciprocal  of  the  resultant  specific  discharge  in  the  SA  normalized  to  the 
uniform  flow  rate  from  the  ambient  condition 
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Note  that  the  permeability  of  the  sorption  system  has  been  substituted  for 
the  product  of  and  k^. 

A  family  of  hypothetical  effective  retardation  curves  were  developed 
for  the  point  location  created  by  the  intersection  of  the  upgradient 
boundary  of  the  SSS  and  the  x  axis  (see  Fig.  12).  Eq.  54  was  substituted 
into  Eq.  61  and  solved  for  a  broad  range  of  residual  SNARL  saturations. 
The  general  retardation  effects  of  the  permeability  ratio  z,  are  the  same 
as  in  the  sorption  zone.  After  deriving  the  effective  retardation 
function  for  the  SA,  it  becomes  obvious  that  changes  in  apparent 
contaminant  velocity  are  due  entirely  to  the  deviations  from  the  ambient 
uniform  flow  field.  These  deviations  are  the  result  of  permeability 
differences  between  the  SSS  and  the  SA  alone.  At  locations  further  away 
from  the  SSS  the  hydraulic  disturbances  created  by  the  system  decrease  and 
the  effective  retardation  approaches  1.0.  The  curves  in  Fig.  12  apply  to 
both  reactive  and  nonreactive  contaminants.  The  sorptive  characteristics 
of  the  porous  media  are  not  altered  in  the  SA;  consequently,  sorption 
should  not  affect  estimates  of  effective  retardation. 
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EFFECTIVE  RETARDATION 
IN  SURROUNDING  AQUIFER 


EFFECTIVE  RETARDATION 


12  Effective  Retardation  in  the  Aquifer  Surrounding 
Hypothetical  Cylindrical  SSS  (Z  =  . 


5.0  CONCLUSIONS  AND  RECOMMENDATIONS 

Experimental  and  theoretical  research  has  been  conducted  on  sub¬ 
surface  contaminant  sorption  systems.  The  goal  of  the  project  was  to 
develop  a  knowledge  base  that  would  be  useful  in  designing  sorption  sys¬ 
tems  and  predicting  system  performance  in  the  field.  In  an  effort  to 
achieve  this  goal,  the  first  research  objective  was  to  perform  experiments 
with  specific  organics,  that  would  deliver  data  needed  to  predict  sorption 
system  capacity  and  performance  for  those  organics.  The  second  objective 
was  to  provide  a  theoretical  foundations  leading  to  the  development  of 
analytical  expressions  which  predict  the  performance  of  subsurface  sorp¬ 
tion  systems  with  cylindrical  geometry. 

With  regard  to  the  first  objective,  experiments  were  successfully 
executed.  Measured  BTCs  were  clearly  simulated  using  available  nonequi¬ 
librium  transport  models.  Basic  parameters  such  as  the  retardation  factor 
were  accurately  and  independently  estimated  without  the  use  of  miscible 
displacement  experiments. 

With  regard  to  the  second  objective,  a  theoretical  analysis  of  a 
general  cylindrical  SSS  produced  a  new  dimensionless  design  parameter,  the 
'Effective  Retardation'.  The  new  parameter  represents  a  concise  expres¬ 
sion  of  system  performance  in  terms  of  residual  SNARL  saturations,  salient 
sorption  parameters,  and  appurtenant  permeabilities.  An  application, 
using  experimental  da^a  on  1-methylnaphthalene,  showed  that  a  SSS  could 
conceivably  decrease  plume  velocities  two  to  three  orders-of-magnitude. 
Under  typical  hydraulic  gradients,  plumes  could  be  retarded  for  3  to  30 
years . 

With  regard  to  future  research,  three  major  recommendations  are 
forwarded 

1)  Independent  methods  of  estimating  nonequilibrium  model  parameters 
should  be  a  research  priority;  but,  only  if  subsurface  sorption  sys¬ 
tems  are  likely  to  be  used  under  induced  groundwater  flow  condi¬ 
tions.  Otherwise,  natural  gradients  are  not  expected  to  produce  t  .e 
level  of  nonequilibrium  that  could  result  in  premature  contaminant 
breakthrough. 
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2)  This  research  effort  derived  expressions  for  effective  retardation 
in  cylindrical  SSSs.  Theoretical  work  is  needed  to  develop  effec¬ 
tive  retardation  expressions  for  other  practical  geometric  configu¬ 
rations  (i.e.,  rectangle,  ellipse,  crescent). 

3)  Laboratory  and  theoretical  tools  are  now  available  to  design  and 
predict  the  performance  of  a  cylindrical  system.  It  is  now  appro¬ 
priate  to  construct  and  test  a  system  on  an  intermediate  scale. 
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ABSTRACT 

The  major  objectives  of  this  research  initiation  project  were  to  demonstrate  that  the  partitioning  of 
polyaromatic  hydrocarbons  (PAHs)  to  soils  can  be  enhanced  by  surfactant  sorption,  and  that  PAH 
affinity  to  soil  depends  on  both  the  amount  and  structure  of  sorbed  surfactant.  The  affinity  of 
naphthalene  for  cetyl  trimethyl  ammonium  bromide  (CTAB)-coated  silica  was  investigated  and 
found  to  depend  on  the  surface  concentration  of  CTAB.  A  model  of  structure  of  the  CTAB-surface 
coating  as  a  function  of  coverage  is  reported  that  accounts  for  the  naphthalene  partitioning 
behavior.  At  the  lower  coverages,  CTAB  is  sorbed  primarily  as  hemimicelles,  but  as  the  amount 
of  CTAB  on  the  surface  increases,  surface  micelles  begin  to  form  in  increasing  number.  Since  the 
surface  micelles  form  a  more  hydrophobic  structure,  naphthalene  molecules  have  a  greater  affinity 
for  them  compared  to  the  hemimicelles.  At  high  enough  coverages,  surfactant  coatings  are  formed 
which  have  a  greater  affinity  for  hydrophobic  contaminants  than  natural  organic  matter.  The 
results  of  this  study  indicate  that  the  structure  and  orientation  of  surfactant  coatings  can  have  a 
significant  impact  on  the  affinity  of  hydrophobic  organic  contaminants  for  mineral  surfaces. 
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I.  INTRODUCTION 


Containment  and  cleanup  of  groundwater  contaminated  by  jet  fuel  spills  at  US  Air  Force  bases  is 
an  area  of  growing  concern.  One  way  to  control  pollutant  migration  or  to  help  remove 
contaminants  from  spill  sites  is  to  introduce  surfactants  (surface  active  agents)  into  groundwater 
aquifers.  These  compounds  can  change  the  chemical  properties  of  the  interfacial  regions  between 
the  fuel,  water,  and  aquifer  solids  and  change  the  mobility  of  both  water-soluble  and  insoluble 
components  of  jet  fuel.  A  better  understanding  of  the  chemical  interaction  of  surfactants  with 
aquifer  material  is  required  before  surfactants  can  be  most  effectively  used  to  contain  contaminant 
plumes  or  clean  up  contaminated  sites. 

II.  STATEMENT  OF  THE  PROBLEM 

The  potential  for  medium-weight  (2-  and  3-ring  compounds)  polyaromatic  hydrocarbons  (PAHs) 
to  contaminate  groundwater  in  soils  with  low  organic  carbon  and  low  clay  content  is  of  primary 
concern  to  the  Air  Force  and  has  been  identified  as  such  in  a  recent  Broad  Agency  Announcement 
(BAA  90-002).  Soil  venting  schemes  currently  being  applied  by  the  Air  Force  to  JP-4  fuel  spills 
result  in  a  low  volatility  residue  containing  medium-weight  PAHs  which  have  moderate  solubility. 
At  JP-4  spill  sites  there  is  concern  that  these  soluble  organic  contaminants  will  move  rapidly  out  of 
biologically  active  zones  and  into  the  groundwater.  One  possible  method  to  reduce  the  transport  of 
these  types  of  contaminants  is  to  add  surfactants  to  increase  the  organic  carbon  content  of  aquifer 
materials,  thereby  increasing  sorption  and  reducing  mobility.  The  Air  Force  is  interested  in 
evaluating  the  potential  for  surfactants  to  reduce  PAH  mobility  at  JP-4  spill  sites. 

III.  OBJECTIVES 

The  major  objectives  of  this  mini-grant  work  were  to  demonstrate  that  the  partitioning  of  PAHs  to 
soils  can  be  enhanced  by  surfactant  sorption  and  that  PAH  affinity  to  soil  depends  on  both  the 
amount  and  structure  of  sorbed  surfactant  Two  types  of  measurements  were  made  on  a 
surfactant/mineral  system:  (1)  surfactant  and  PAH  sorption  affinity  measurements  as  a  function  of 
pH  to  find  optimal  conditions  '"or  PAH  sorption,  and  (2)  interfacial  potential  measurements  to 
determine  how  the  structure  oi  sorbed  surfactant  affects  subsequent  PAH  sorption. 
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IV.  BACKGROUND 


When  a  fuel  product  is  accidentally  introduced  into  the  soil  environment,  its  fate  and  transpon 
depend  on  the  physical  and  chemical  properties  of  the  soil  and  the  chemical  characteristics  of  the 
fuel.  A  particularly  important  property  of  a  soil  matrix  in  this  context  is  its  organic  carbon  content 

The  capacity  of  a  soil  to  sorb  nonionic,  hydrophobic,  organic  compounds  is  related  to  the  organic 
carbon  content  of  the  soil  (Chiou  et  al.,  1979, 1983).  Cationic  surfactants  can  be  added  to  clays  to 
increase  their  sorption  capacity,  leading  to  enhanced  sorption  of  low  molecular  weight  hydrophobic 
compounds  (Bouchard  et  al.,  1988;  Boyd  et  al.,  1988).  Some  recent  work  has  been  conducted 
illustrating  that  adsorption  of  the  nonionic  surfactant  polyethylene  glycol  can  lead  to  enhanced  PAH 
sorption  onto  aquifer  solids  (Podoll  et  al.,  1988). 

While  these  studies  illustrate  the  potential  for  surfactants  to  affect  hydrophobic  contaminant 
sorption,  none  have  attempted  to  relate  the  structure  of  the  sorbed  surfactant  to  contaminant 
affinity.  In  monitoring  surfactant  sorption,  the  most  significant  advances  in  understanding 
sorption  behavior  have  been  made  by  combining  information  from  several  types  of  measurements. 
In  past  studies,  adsorption  isotherms,  electroldnetic  data  and  contact  angle  measurements  have 
yielded  the  greatest  insights  into  structure  of  sorbed  surfactants  (Ginn,  1970;  Fuerstenau,  1957; 
Fuerstenau  et  al.,  1964;  Somasundaran  and  Fuerstenau,  1966).  Recently,  fluorescence 
spectroscopic  studies  (Parcher  et  al.,  1989)  have  been  performed  to  clarify  the  role  of  an  organic 
coating's  structure  in  the  sorption  affinity  of  hydrophobic  contaminants  to  organic  coatings.  Using 

a  fluorescing  hydrophobic  probe  molecule,  Parcher  et  al.  (1989)  found  that  silica  coated  with  a 
short  chain  (C2)  hydrocarbon  and  having  a  relatively  lower  organic  carbon  content  (5.6%)  was 

more  effective  at  removing  a  hydrophobic  solute  than  silica  coated  with  longer  chain  (Cg  and  C^g ) 
hydrocarbons  and  having  higher  organic  content  (14%  and  22%).  Although,  conventional  wisdom 
suggests  that  the  surface  coated  with  a  longer  chain  hydrocarbon  and  having  a  higher  organic 
content  would  be  more  hydrophobic  and  should  have  a  higher  affinity  for  a  sorbing  hydrophobic 
compound,  this  was  contrary  to  what  was  observed.  Parcher  et  al.  (1989)  found  that  the  structure 
of  the  organic  moiety  at  the  surface  was  responsible  for  the  observed  partitioning  behavior,  not  the 
chain  length  of  the  organic  coating  or  its  percent  by  weight  of  the  solid.  Only  by  using  methods 
that  can  identify  the  structure  and  orientation  of  surfactant  molecules  in  the  interfacial  region  can  it 
be  learned  what  types  of  surface  organic  coatings  have  the  greatest  affinity  for  various  classes  of 
hydrophobic  contaminants. 

A  technique  which  has  been  used  in  the  past  and  is  used  in  this  study  to  characterize  the  structure 
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of  sorbed  surfactants  is  the  measurement  of  electrophoretic  mobility  as  a  function  of  surfactant 
sorption  density.  In  particular,  shifts  in  the  magnitude  and  sign  of  the  electrophoretic  mobility  as  a 
function  of  surfactant  coverages  can  be  used  to  infer  the  structures  of  sorbed  complexes 
(Somasundaran  and  Fuerstenau,  1966;  Shergold,  1986;  Yap  et  al.,  1981).  The  approach  is 
illustrated  using  electrophoretic  mobility  and  CTAB  (cetyl  trimethyl  ammonium  bromide)  sorption 
density  on  silica  from  data  of  Hayes  (1991)  (Figure  1). 


B 


•o 

< 


CTAB  concentration  in  solution  (M) 


Figure  1.  Adsorption  density  and  dynamic  electrophoretic  mobility  for  CTAB  on 
0.5  -  10  |im  silica  at  O.OIM  NaCTl  and  pH  8. 


The  CTAB  sorption  isotherm  can  be  divided  into  the  four  regions  shown  in  Figure  1.  In  region  I, 
isolated  CTAB  molecules  are  thought  to  be  sorbed  by  an  electrostatic  attraction  between  negatively 
charged  surface  sites  and  the  positively  charged  surfactant.  In  this  region,  the  silica  surface  is 
hydrophilic  since  it  is  still  composed  of  nrostly  deprotonated  surface  hydroxl  groups  as  indicated 
by  the  negative  value  of  the  electrophoretic  mobility.  The  marked  increase  in  sorption  in  region  n 
is  thought  to  be  mainly  due  to  hemimicelle  (cluster  of  surfactant  molecules  on  the  surface) 
formation,  a  result  of  lateral  interactions  between  the  hydrocarbon  chains.  The  electrophoretic 
mobility  is  negative  in  the  absence  of  CTAB,  beet  .nes  less  negative  with  the  addition  of  (TTAB, 
and  finally  changes  sign.  As  the  CTAB  concentration  increases  further,  the  slope  in  the  isotherm 
decreases  (region  HI)  reflecting  a  lower  affinity  of  a  positively  charged  surfactant  for  a  positive 
surface.  The  increase  in  sorption  in  this  region  is  thought  to  be  mainly  due  to  the  favorable 
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hydrophobic  interactions  between  the  hydrocarbon  portions  of  the  surfactant  molecules.  This 
hydrophobic  interaction  may  also  lead  to  bilayer  formation,  which  is  thought  to  be  negligible  in 
region  II,  but  substantial  in  region  ID.  Finally,  the  sorption  isotherm  reaches  a  plateau  (region  IV) 
near  the  critical  micelle  concentration  (CMC)  of  CTAB  (9.0  x  10^  M).  The  sorption  density  and 
the  electrophoretic  mobility  remain  constant  at  this  point  because  the  formation  of  solution  phase 
micelles  is  energetically  more  favorable  than  additional  sorption.  Figure  2  schematically  illustrates 
this  interpretation  of  the  stmcture  and  configuration  of  CTAB  molecules  on  silica  as  a  function  of 
surface  coverage. 


region  I  region  II  region  III 

Figure  2.  Structures  of  the  surfactant  adsorption  isotherm. 
Stratton-Crawley  and  Shergold,  1981.) 


region  IV 
(After 


In  the  present  work  naphthalene,  an  organic  contaminant,  is  sorbed  to  CTAB-coated  silica  to 
investigate  the  effects  of  the  orientation  of  surfactant  molecules  on  their  affinity  for  naphthalene. 
Previous  solution  phase  work  (Edwards  et  al.,  1991)  has  demonstrated  that  the  aqueous  phase 
solubility  of  naphthalene  is  significantly  enhanced  at  surfactant  concentrations  above  the  CMC.  It 
is  possible  that  the  surface  equivalent  of  a  micelle  in  solution  may  also  play  a  role  in  enhancing 
sorption  of  polyaromatic  hydrophobic  organic  contaminants.  As  mentioned  above,  one  of  the 
goals  of  this  study  was  to  determine  if  a  particular  structure  of  sorbed  surfactant  would 
substantially  enhance  naphthalene  partitioning. 


V.  EXPERIMENTAL  PROTOCOL 
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Silica.  The  silica  used  in  these  experiments  was  #40  Sil-co-sil  ground  silica  obtained  from  the 
U.S.  Silica  Company.  This  silica  was  from  their  Ottawa,  IL  mine,  and  was  used  without  funher 
treatment.  Using  BET  adsorption,  this  silica  was  found  to  have  a  specific  surface  area  of  5.6 

ni2/g. 

Surfactant  Adsorption  Measurements.  Batch  sorption  experiments  were  conducted  using 
CTAB.  The  quantity  sorbed  was  determined  by  the  loss  of  solute  from  the  liquid  phase.  The 
reactors  used  were  4  ml  (nominal)  glass  vials  with  Teflon-faced  rubber  septa.  Water,  silica, 
surfactant  (and  its  radiolabelled  tracer),  background  electrolyte  (NaCl),  and  base  (NaOH)  were 
added  in  appropriate  amounts  to  the  vials.  The  ionic  strength  was  kept  constant  at  0.01  M. 
Experiments  were  conducted  at  three  different  surfactant  concentrations,  0.0583, 0.583,  and  1.67 
mM  and  at  pH  values  of  8. 1  and  9.8.  Mixing  of  the  vials  was  accomplished  by  end-over-end 
rotation  of  the  reaction  vials  for  a  period  of  16  -  24  hours  at  8  rpm. 

Solid/liquid  separation  was  accomplished  by  centrifuging  for  two  hours  at  4,000  rpm  at  20  on  a 
Sorvall  RC-5B  refrigerated  centrifuge  with  a  GSA  rotor  (maximum  relative  centrifugal  force  = 
2,604).  For  each  samrle,  the  pH  was  measured  using  a  Ross  combination  pH  electrode  and  an 
aliquot  of  supernatant  .s  removed  to  measure  *"^0  activity  on  an  LKB  Wallac  1219  Rackbeta 
liquid  scintillation  counter.  In  order  to  insure  low  surfactant  solution  concentration  (well  below  the 
CMC)  in  the  aqueous  phase,  most  of  the  remaining  supernatant  was  removed  and  replaced  with  an 
equivalent  volume  of  surfactant-free  "wash"  water  at  O.OIM  NaQ  and  the  appropriate  pH.  The 
washing  solution  pH  was  adjusted  to  the  pH  measured  prior  to  solid/liquid  separation,  and  this 
was  taken  as  the  pH  of  the  sample.  The  final  surface  concentration  was  determined  by  accounting 
for  any  additional  desorption  which  occurred  following  this  washing. 

Naphthalene  Sorption  Measurements.  Separate  naphthalene  sorption  experiments  were 
conducted  without  *'^C  labelled  CTAB  by  replacing  the  washing  solution  described  above  with  a 
solution  of  naphthalene  having  the  desired  pH,  naphthalene  concentration  and  an  ionic  strength  of 
O.OIM  (NaCl).  The  vials  were  filled  to  eliminate  head  space  and  then  capped  with  Teflon  lined 
rubber  septa.  The  solids  were  resuspended  by  vortex  mixing  and  then  put  on  a  rotator  for  12  - 18 
hours  to  allow  equilibration.  Then  the  vials  were  vortexed  again  (mainly  to  make  sure  there  was 
no  solid  sticking  to  the  caps)  and  centrifuged  as  in  the  CTAB  sorption  experiments. 

Naphthalene  was  measured  by  injection  of  the  supernatant  into  a  Waters  HPLC  using  a  70/30 
methanol/water  mobile  phase  and  a  Supelco  15  cm  Econosphere  5jim  C18H38  column.  The 


apparatus  had  a  Lambda-Max  Model  48  LC  Spectrophotometer  and  an  M-45  Solvent  Delivery 
System  operated  at  1.0  ml/min. 


Dynamic  Electrophoretic  Mobility  Measurements.  Dynamic  electrophoretic  mobility  was 
measured  using  an  electrokinetic  sonic  amplitude  (ESA)  measurement  apparatus.  In  order  to  make 
ESA  measurements,  an  alternating  current  is  applied  to  a  suspension  of  particles,  causing  these 
particles  to  undergo  oscillatory  motions  due  to  the  oscillating  electrical  forces  on  the  charged 
particles.  An  acoustic  wave  develops  due  to  the  difference  in  density  between  the  particles  and  the 
liquid.  Pulsed  ultrasonic  methods  with  phase  sensitive  detection  are  used  to  measure  the 
magnitude  and  phase  angle  of  the  electro-acoustic  signal  generated.  The  dynamic  electrophoretic 
mobility  is  calculated  from  ESA  measurements  according  to  the  theory  described  by  O’Brien 
(1988).  The  instrument  used  for  these  measurements  was  a  Matec  ESA  8000  Measurement 
System. 

VI.  RESULTS 


Figures  3  and  4  show  isotherms  for  naphthalene  sorbed  to  silica  ■  .  both  the  presence  and  absence 
of  a  CTAB  coating  at  two  different  pH  values.  The  lines  fitted  to  the  data  are  used  to  calculate 
values  of  tlie  naphthalene  partition  coefficient  (K<i)  for  each  coverage  and  pH  combination 
measured.  The  partition  coefficient,  defined  by 


where  qe,  Q  are  the  amount  of  naphthalene  sorbed  and  equ.  brium  solution  concentration, 
respectively,  is  the  slope  of  each  line  in  Figures  3  and  4.  The  values  of  the  partition  coefficients 
estimated  for  each  coverage  and  pH  value  are  presented  in  Table  I.  These  K<i  values  can  also  be 
normalized  for  the  fraction  of  organic  carbon  (foe)*  or  for  tlie  fraction  of  the  surface  covered  (f^^) 

by  the  CTAB  (assuming  26  per  CTAB  molecule  (Stratton-Crawley  and  Shergold,  1981)  and 
monolayer  coverage)  to  give  a  sorption  coefficient  which  is  typically  referred  to  as  the  Kqc  value, 

e.g., 


Kd 


oc 


where  foe  =  (mass  of  surfactant  sorbed  per  mass  of  silica)  or 

Kd 


Kqc 


‘sc 


(2) 

(3) 
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where 


=  (m^  of  surfactant  sorbed  per  of  silica) 


Table  I.  CTAB  surface  coverages  and  K<i  values. 


CTAB  surface 
coverage 

Kd 

pH 

(mol/m^) 

(I7m2) 

(cm^/g) 

1.49  X  10-7 

5.53  X  10-6 

1,311 

8.1 

1.44  X  10-6 

4.15  X  10-4 

1,985 

8.1 

3.26  X  10-6 

2.34  X  10-3 

2,976 

8.1 

1.81  X  10-7 

5.42  X  10-5 

163 

9.8 

1.66  X  10-6 

7.52  X  10-4 

1,262 

9.8 

3.44  X  10-6 

2.34  X  10-3 

3,151 

9.8 

Values  of  K<i  normalized  by  fQ^.  and  fg^  plotted  versus  fQ^  and  fg^  are  presented  as  Figures  5  and 

6,  respectively.  If  the  amount  of  organic  carbon  is  the  most  important  parameter  in  determining 
naphthalene  partitioning,  then  values  of  Kq^  would  be  expected  to  be  independent  of  the  surface 

coverage,  e.g,  Kq^  would  be  constant.  If,  however,  the  structure  and  orientation  of  a  surfactant  is 

important  in  naphthalene  partitioning,  then  the  normalized  values  of  K<i  would  be  expected  to 

change  as  a  function  of  surface  structure  for  naphthalene.  Based  on  the  results  in  Figures  5  and  6, 
which  show  that  the  value  of  Kq^  increases  with  surfactant  coverage,  it  is  clear  that  the  partitioning 

of  naphthalene  becomes  more  favorable  at  higher  surfactant  coverages.  The  values  of  Kqq  in  units 
of  cm^/g  as  a  lu  .of  surfactant  are  also  given  in  Table  I. 


To  funher  clarify  the  role  of  structure  and  orientation  of  the  surfactant  on  naphthalene  partitioning 
as  a  function  of  surfactant  surface  coverage,  the  electrophoretic  mobility  of  silica-coated  particles 
was  estimated  by  making  ESA  measurements.  The  results  of  the  dynamic  electrophoretic  mobility 
are  shown  in  Figure  7.  As  discussed  above,  the  electrophoretic  mobility  measurements,  in 
combination  with  the  sorption  isotherms  data,  can  be  used  to  infer  the  structure  and  orientation  of 
so’^bed  CTAB  (Figures  1  and  2).  This  information  can  then  be  used,  in  turn,  to  explain  the 
structural  basis  for  the  enhanced  affinity  of  naphthalene  for  CTAB-coated  silica  at  high  CTAB 
coverages. 
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CTAB  coverage: 

•  high 
O  medium 
A  low 
D  silica 

pH  =  9.8 
NaCl  =  0.01M 


0.00  0.02  0.04  0.06  0.08  0.10 

naphthalene  concentration  in  solution  (mM) 


Figure  3.  Naphthalene  isotherms  on  CTAB-coated  silica  at  pH  =  9.8. 


CTAB  coverage: 

•  high 
O  medium 
A  low 
D  silica 

pH  =  8.1 
NaCl  =  0.01M 


naphthalene  concentration  in  solution  (mM) 


Figure  4.  Naphthalene  isotherms  on  CTAB-coated  silica  at  pH  =  8.1. 
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Figure  5.  Naphthalene  sorption  to  silica  plotted  versus  ^oc- 


Figure  6.  Naphthalene  sorption  to  silica  plotted  versus  ^sc  (assuming  monolayer  coverage  and 
26  A2/molecjle  (Stratton-Crawley  and  Shergold,  1981)). 
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Figure  7.  Dynamic  electrophoretic  mobility  as  a  function  of  the  fraction  of  the  silica 
surface  covered  by  CTAB  (assuming  monolayer  coverage  and  26  A^/molecule). 

VH.  DISCUSSION 

As  shown  in  Figures  3  and  4,  the  naphthalene  partitioning  to  uncoated  silica  particles  is  negligibly 
small.  This  is  expected,  since  the  surface  of  uncoated  silica  in  water  is  essentially  hydrophilic  and 
should  have  little  affinity  for  a  hydrophobic  solute  like  naphthalene,  while  the  surfactant  coating 
renders  the  surface  more  hydrophobic.  As  also  shown,  naphthalene  sorption  increases  with 
increasing  concentration  of  sorbed  surfactant.  This,  too,  is  expected,  because  as  the  amount  of 
CTAB  on  the  surface  increases,  there  is  an  increasing  amount  of  an  organic  carbon  phase  on  the 
surface  to  which  naphthalene  can  partition.  When  the  partition  coefficient  of  naphthalene  sorbed 
per  mass  of  organic  carbon,  Kq^^,  is  plotted  versus  surface  coverage,  it  is  found  that  Kq^  also 
increases  with  coverage  (Figures  5  and  6).  In  general,  increasing  values  of  Kq^  with  coverage  are 

not  expected,  unless,  as  a  function  of  surfactant  coverage,  the  surface  is  becoming  increasingly 
more  hydrophobic  per  unit  mass  of  organic  carbon.  Thus,  the  increase  in  observed  with 

increasing  surfactant  coverage  must  be  related  to  changes  in  the  structure  and  orientation  of  the 
surfactant  molecules  on  the  silica  surface  as  the  surface  coverage  increases. 

A  plausible  explanation  of  the  increase  in  the  value  of  Kq^  with  coverage  can  be  made  based  on  the 

description  of  the  structure  and  orientation  changes  expected  for  sorbed  CTAB  as  a  function  of 
coverage  (Figures  1  and  2).  Assuming  that  the  low  CTAB  sorption  coverages  reported  here  are 
representative  of  hemimicelle  formation  of  region  II,  and  that  the  higher  coverages  are 
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representative  of  bilayer  formation  of  region  HI,  a  plausible  explanation  of  the  increase  of  Kq^  can 

be  made-  When  naphthalene  sorbs  to  a  hemimicelle-like  coating  (region  II),  it  is  less  effectively 
sheltered  from  the  water  phase  and  some  of  naphthalene  molecule  may  still  be  exposed  to  the 
aqueous  phase.  At  higher  coverages,  however,  naphthalene  is  sorbed  in  the  midst  of  the  bilayer 
(region  HI).  Since  the  bilayer  region  likely  provides  a  more  hydrophobic  environment  and  may 

more  completely  shelter  naphthalene  from  the  water  phase,  it  has  a  greater  affinity  for  naphthalene, 
and  hence  it  is  expected  that  Kq^  would  increase  as  the  relative  concentration  of  the  bilayer  or 

surface  "micelles"  compared  to  the  surface  hemimicelles  increases  with  coverage.  That  the 
organic-noimalized  partition  coefficient,  Kq^.,  increases  monotonically  with  surfactant  coverage 

indicates  that  the  formation  of  surface  micelles  may  be  a  continuous  process,  in  contrast  to  the 
critical  micelle  concentration  that  occurs  in  the  aqueous  phase.  However,  more  work  is  needed 
over  a  wider  range  of  surfactant  coverages  and  pH  conditions  to  verify  this. 

Additional  support  for  the  above  proposed  structural  model  for  surfactant  sorption  and  naphthalene 
partitioning  as  a  function  of  surfactant  surface  coverage,  has  been  obtained  from  electrophoretic 
mobility  measurements  (Figure  7).  As  described  above,  surfactant  sorption  data,  used  in 
conjunction  with  electrophoretic  mobility  data,  can  be  used  to  identify  the  structure  and  orientation 
of  surfactants  as  a  function  of  coverage  (Figures  1  and  2).  In  particular,  the  transition  from  the 
hemimicelle  to  bilayer  structure,  in  going  from  regions  II  and  III,  is  thought  to  occur  near  the  point 

of  the  reversal  of  the  sign  of  the  electrophoretic  mobility  as  a  function  of  surfactant  coverage.  For 
the  conditions  reported  here,  the  electrophoretic  imbility  sign  reversal  occurs  between  the  low  (f^^, 

=  0.02)  and  medium  (f^^  =  0.20)  coverage  ranges  studied  (Figure  7).  This  supports  the 

conclusion  that  the  sorption  of  naphthalene  is  occurring  to  CJTAB-coated  silica  represented  by 
regions  11  and  III.  Hence,  this  data  provides  further  support  for  the  contention  that  the  surface  is 
becoming  increasingly  more  hydrophobic  for  naphthalene  partitioning  with  coverage,  with  the 
transition  frx)m  region  11  to  region  III  occurring  near  the  sign  reversal  of  the  dynamic 
electrophoretic  mobility. 

The  effects  of  pH  on  the  naphthalene  panitioning  also  provide  further  support  for  the  proposed 

model  of  a  surface  which  becomes  increasingly  more  hydrophobic  with  increasing  surfactant 
coverage.  As  shown  in  Figures  5  and  6,  Kq^  for  naphthalene  increases  nwnotonically  at  both  the 

pH=  8. 1  and  pH  =  9.8  samples.  It  is  interesting  to  note  that  the  slope  of  the  Kq^.  versus  foe  is 

greater  for  the  pH  8. 1  compared  to  pH  =  9.8  samples.  A  possible  reason  for  this  is  that  transition 
from  hemimicelle  to  bilayer  formation  occurs  at  lower  surfactant  coverages  at  a  pH  value  of  8. 1 
compared  to  pH  9.8  samples.  Since,  at  the  lower  pH,  uncoated  silica  has  fewer  negative  sites  to 
sorb  surfactants  in  region  I,  it  is  possible  that  there  are  correspondingly  fewer  sites  frt>m  which 
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hemimicelle  and  bilayers  will  form  as  coverage  increases  at  lower  pH.  Hence,  the  transition  from 

region  II  to  III  would  be  expected  to  occur  at  lower  coverages  and  to  give  steeper  slope  values  in  a 
plot  of  versus  foe  at  lower  pH.  Although  a  steeper  slope  is  observed,  the  transition,  based  on 

the  zero  point  of  the  electrophoretic  mobility,  does  not  appear  to  occtir  at  lower  coverages  for  the 
pH  8.1  compared  to  9.8  systems  (Figure  7).  More  work  is  needed  on  the  effects  of  surfactant 
coverage,  electrophoretic  mobility,  and  naphthalene  partitioning  as  a  function  of  pH  to  verify  this 
hypothesis. 

Finally,  a  comparison  of  the  values  of  Kq^  for  naphthalene  sorption  to  surfactant  coatings 
compared  to  natural  soil  organic  coatings  is  instructive.  An  estimate  of  the  value  for  soil 
organic  carbon  can  be  obtained  from  correlations  that  have  been  established  between  LogKQ^^  and 
LogK^^,  the  Log  of  the  octanol/water  partition  coefficient  (Karickhoff,  1984;  Curtis  et  al.,  1986). 

For  example,  Karickhoff  has  found  the  that  relationship 

LogKoc  = 

describes  the  partitioning  behavior  of  hydrophobic  contaminants  to  natural  organic  matter  quite 
well.  Using  the  relationship  of  Eq.  4,  and  a  value  of  LoglC^^y  1971),  the 

predicted  value  of  Kqq  is  1 ,550  which  is  near  the  values  estimated  from  the  lower-  and 

mid-range  coverage  samples  in  this  study  (Table  I).  Hence,  it  can  be  seen  that  surfactant  coatings 
on  minerals  can  be  generated  which  are  even  more  effective  than  natural  soil  organic  matter.  This 
is  not  too  surprising  in  view  of  the  fact  that  surface  micelles  which  arc  formed  at  higher  coverage 
would  be  expected  to  be  more  hydrophobic  than  natural  humic  material  coatings  per  mass  of 
organic  carbon.  Others  have  reported  similar  enhanced  partitioning  of  hydrophobic  contaminants 
following  surfactant  modification  of  soils  (Boyd  et  al.,  1988). 

VIII.  CONCLUSION 

The  affinity  of  naphthalene  for  CTAB  coated  silica  was  found  to  depend  on  the  surface 
concentration  of  CTAB.  A  model  with  a  continuous  transition  between  two  surface  orientations 
for  the  CTAB  at  the  coverages  reported  here  accounts  for  this  behavior.  (TAB  at  the  lower 
coverages  is  sorbed  primarily  as  hemimicelles.  but  as  the  anwunt  of  CTAB  on  the  surface 
increases,  it  increasingly  tends  to  form  surface  micelles.  The  surface  micelles  form  a  more 
hydrophobic  structure,  causing  naphthalene  rrwlecules  to  have  greater  affinity  for  them  compared 
to  the  hemimicelles.  At  high  enough  coverages,  surfactant  coatings  can  be  formed  which  have  a 
greater  affinity  for  hydrophobic  contaminants  than  natural  organic  matter. 
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IX.  FUTURE  WORK 


This  study  indicates  that  the  structure  and  orientation  of  surfactant  coatings  for  mineral  surfaces  can 
have  a  significant  impact  on  the  affinity  of  hydrophobic  organic  contaminants  for  mineral  surfaces. 
There  are  many  interesting  experiments  which  naturally  follow  from  the  results  of  this 
investigation.  The  results  of  the  present  study  indicate  that  a  continuous  transition  between  a 
hemimicelle  and  micelle  surface  structure  is  occurring.  However,  the  surface  should  eventually 
become  predominantly  surface  micelles  as  coverage  increases  and  the  plot  of  Kq^.  versus  foe 

should  eventually  level  off  when  this  happens.  In  addition,  while  our  data  seem  to  indicate  that  a 
continuous  transition  from  a  surface  hemimicelle  to  micelle  state  is  occurring  as  a  function  of 
coverage,  additional  work  is  needed  to  determine  if  there  is,  in  fact,  a  critical  surface  micelle 
concentration,  the  equivalent  of  the  critical  micelle  concentration  in  solution.  These  areas  can  be 

investigated  by  performing  naphthalene  sorption  experiments  at  more  surface  coverages  to 
determine,  if  over  more  narrow  coverage  ranges,  a  change  in  the  slope  of  a  plot  of  versus  foe 

occurs.  Since  surface  charge  varies  with  pH,  the  importance  of  surface  charge  on  the  type  of 
surface  coating  formed  can  be  more  thoroughly  examined  by  expanding  the  investigation  over  a 
greater  pH  range.  In  addition,  other  surfactants  and  PAHs  should  be  studied  to  determine  if  the 
conclusions  drawn  from  this  woric  are  more  generally  applicable.  Finally,  it  is  also  important  to 
obtain  more  direct  confirmation  of  the  structure  and  orientation  of  sorbed  surfactants  as  a  function 
of  surface  coverages.  We  have  recently  found  that  by  using  self-supporting  films  and  transmission 
Fourier  Transform  Infra-Red  (FTIR)  spectroscopy  it  is  possible  to  investigate  the  structure  of 
sorbed  surfactants  at  mineral/water  interfaces.  Results  from  these  preliminary  FUR  studies  are 
consistent  with  the  description  of  the  structure  of  surfactant  surface  coatings  as  a  function  of 
coverage  as  presented  here.  These  FTIR  results  will  be  reported  in  the  near  future. 
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XII.  APPENDIX  (Notation) 


Q  solution  concentration  in  equilibrium  with  surface  (mol/L) 

C!MC  critical  micelle  concentration 

CTAB  cetyl  trimethyl  ammonium  bromide;  Ci6H33N(CH3)3Br 

ESA  electrokinetic  sonic  amplitude 

foe  fraction  of  organic  carbon  in  system  solids  (g/g) 

fsc  fraction  of  surface  coverage  of  system  solids  (m^/m^) 
g  gram 

Kqc  (L/m^)  or  (cm^/g) 

Kd  partition  coefficient,  ratio  of  surface  concentration  to  solution  concentration  (IVra^) 
L  liter 

m  meter 

ml  milliliter 

mol  mole 

PAH  polyaromatic  hydrocarbon 

Qe  surface  concentration  in  equilibrium  with  solution  (mol/m^) 
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ABSTRACT 

Removal  of  twenty  hydrocarbons  from  a  sandy  loam  soil  was  determined  over 
a  15  day  period.  Within  5  days,  all  hydrocarbons  with  boiling  points  lower  than  that 
of  undecane  were  removed  from  soil,  whether  untreated  or  treated  with  mercuric 
chloride  to  inhibit  microbial  activity.  At  10  days,  the  long  chain  alkanes  remained 
in  the  soil.  Tridecane,  tetradecane  and  pentadecane  showed  significantly  greater 
removal  in  the  untreated  than  in  the  treated  soil,  indicating  that  biodegradation 
played  a  role  in  remsdoval  of  the  higher  boiling  points  hydrocarbons. 

Microbial  abundance  and  hydrocarbon-degrading  ability  were  assessed  in  soil 
contaminated  by  jet  fuel  during  a  bioventing  project.  Soil  samples  from  a 
contaminated  site  and  an  adjacent  uncontaminated  site  were  collected  at  the  * 

beginning  and  end  of  a  9-month  project.  Total  direct  counts,  viable  counts,  and 
heterotrophic  activity  (radiolabeled  glucose  mineralization)  were  used  to  assess 
abundance  and  activity  of  the  heterotrophic  microbial  population.  Hydrocarbon- 
degrading  capacity  of  the  miaobial  population  was  assessed  by  estimating  numbers 
of  microorganisms  capable  of  utilizing  naphthalene,  hexadecane  and  toluene  (MPN  « 

method),  and  by  determining  the  mineralization  of  the  above  three  hydrocarbons  in 
short  (15  hour)  and  long-term  (21-28  days)  experiments.  Results  indicate  that 
hydrocarbon  contamination  altered  the  activity  of  the  bacterial  population,  producing 
lower  numbers  and  heterotrophic  activity  of  the  population  while  increasing  the 
hydrocarbon-degrading  capacity  of  the  population.  By  the  end  of  the  project,  the 
activity  of  the  bacterial  population  in  the  contaminated  soil  had  shifted  to  a  pattern 
characteristic  of  the  uncontaminated  site. 
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INTRODUCTION 


Previous  studies  on  the  environmental  fate  of  JP-4  have  focused  on  the 
aquatic  enviroiunent.  Biodegradation  of  t  he  hydrocarbon  components  of  JP-4  was 
studied  by  Spain  and  workers  (1983).  In  these  studies,  JP-4  as  well  as  a  model  fuel 
made  up  of  known  quantities  of  individual  hydrocarbons  present  in  JP-4  were  added 
to  water  and  sediment  samples  from  several  aquatic  environments,  and  the 
disappearance  of  hydrocarbons  was  followed  over  several  days.  Use  of  aquatic 
samples  which  were  killed  by  the  addition  of  mercuric  chloride  allowed  comparison 
of  biotic  and  abiotic  removal  processes. 

Results  indicated  that  evaporation  was  the  major  removal  process  for  the  low 
molecular  weight,  volatile  hydrocarbons.  Addition  of  sediment  to  water  samples 
affected  the  removal  of  JP4  components  by  reducing  the  rate  of  volatilization.  For 
most  individual  hydrocarbons,  biodegradation  was  not  as  significant  for  removal  as 
was  evaporation.  For  those  hydrocarbons  which  were  susceptible  to  biodegradation, 
the  extent  of  biodegradative  removal  was  a  function  of  the  water  sample.  In  general, 
water  samples  taken  from  environments  which  had  seen  previous  exposure  to 
hydrocarbons  exhibited  a  greater  extent  of  biodegradation  than  water  samples  taken 
from  pristine  environm:nts. 

To  further  elucidate  the  role  of  biodegradation  in  removal  of  JP-4,  additional 
studies  were  conducted  by  Pritchard  and  coworkers  (1988)  using  a  quiescent  bottle 
system  designed  to  simulate  a  fuel  spill  on  a  quiet  body  of  water.  Results  again 
indicated  that  volatilization  was  the  primary  removal  process  with  lower  molecular 


weight  hydrocarbons  being  lost  at  a  more  rapid  rate  than  higher  molecular  weight 
hydrocarbons. 

The  primary  goal  of  the  recently  completed  study  undertaken  as  part  of  the 
Summer  Faculty  Research  Program  was  to  evaluate  the  processes  responsible  for  the 
removal  of  JP-8  from  the  environment.  In  addition  to  bottles  receiving  water  and 
water/sediment  slurries,  a  series  of  bottles  containing  soil  were  included  in  this  study. 
The  primary  removal  process  in  both  water  and  water/sediment  slurries  was 
identified  as  evaporation.  However,  in  the  case  of  soil,  biodegradation  represented 
a  significant  removal  process.  For  this  reason,  it  is  felt  that  parallel  studies  should 
be  undertaken  with  JP-4  since  comparable  laboratory  data  on  its  biodegradative 
potential  in  soil  are  lacking. 

The  enhanced  removal  of  hydrocarbons  exhibited  in  soils  containing  active 
miCTobial  populations  raises  the  possibility  of  utilizing  microorganisms  in  the 
remediation  of  contaminated  soil.  Bioremediation  has  received  considerable 
attention  as  a  cost  effective  means  of  removing  petroleum  (Bartha,  1986)  as  well  as 
a  wide  variety  of  other  organic  chemicals  (Wilson  et  al.,  1986;  Lee  at  al.,  1988). 
These  review  articles  record  numerous  examples  of  the  removal  of  petroleum 
contaminants  from  soil.  Most  of  these  studies  focus  on  the  engineering  design  of  the 
remediation  technology  and  the  degree  of  remediation  achieved;  in  other  words, 
bioremediation  is  treated  as  a  "black  box"  into  which  contaminated  soil  is  placed  and 
out  of  which  clean  soil  is  obtained.  Only  a  handful  of  studies  address  the  microbial 
ecology  of  contaminated  soils. 
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Thomas  and  coworkers  (1989)  studied  microbial  artivity  at  a  creosote  waste 
site  and  demonstrated  that  while  active  microbial  populations  were  present  in 
pristine,  slightly,  and  heavily  contaminated  soils,  mineralization  of  polynuclear 
aromatic  hydrocarbons  was  only  significant  in  the  contaminated  soils.  Thus,  the 
microbial  population  has  adapted  to  the  presence  of  the  contaminating  hydrocarbons 
by  increasing  its  biodegradative  potential.  A  similar  increase  in  numbers  and  activity 
of  hydrocarbon  degrading  organisms  was  observed  in  a  studies  of  microbial 
degradation  of  aromatic  hydrocarbons  at  a  hazardous  waste  site  (Lee  at  al.,  1984) 
and  microbial  activity  in  sediments  receiving  run-off  from  oil  sand  deposits 
(Wyndham  and  Costerton,  1981).  In  the  latter  case  and  in  a  study  of  phenol¬ 
degrading  activity  at  an  abandoned  hazardous  waste  site  (Dean-Ross,  1989),  no 
correlation  was  observed  between  the  extent  of  contamination  and  the  activity  of  the 
microbial  populations. 

While  the  above  studies  provide  information  essential  for  an  understanding 
of  the  microbial  ecology  of  contaminated  soils,  they  do  not  address  the  microbiology 
of  bioreraediation.  None  of  these  studies  followed  changes  in  microbial  abundance 
and  activity  during  an  actual  bioremediation  project.  The  proposed  study  was 
designed  to  provide  basic  data  on  microbial  abundance  and  activity  to  aid  in  the 
interpretation  of  engineering  data  from  bioremediation  projects. 
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OBJECTIVES 


A.  Overall  objective:  to  evaluate  the  biodegradability  of  Air  Force  jet  fuel  JP-4  in 
contaminated  soil. 

B.  Specific  objectives: 

1.  To  determine  the  rate  and  extent  of  biodegradation  of  JP-4  in  soil. 

Soil  contamination  of  jet  fuels  fi’om  accidental  spills  or  leaking  underground 
storage  tanks  represents  a  recurrent  input  of  hydrocarbons  into  the  terrestrial 
environment.  In  a  study  conducted  as  part  of  the  Summer  Faculty  Research 
Program,  it  was  found  that  JP-8,  a  jet  fuel  used  in  Europe  and  proposed  for  use  in 
the  United  States,  was  removed  faster  in  soil  containing  an  active  population  of 
microorganisms  than  in  inactive  soil,  indicating  that  biodegradation  is  a  significant 
factor  in  the  environmental  fate  of  jet  fuels.  Since  no  laboratory  data  is  available  for 
JP-4,  the  first  objective  will  remedy  this  lack  of  data. 

2.  To  determine  the  rate  and  extent  of  biodegradation  of  hydrocarbon 
components  of  JP-4  in  uncontaminated  soil,  contaminated  soil  and  soil  contaminated 
with  JP-4  but  treated  to  encourage  biodegradation. 

The  Environics  Division  of  the  Engineering  and  Services  Center  at  Tyndall 
AFB  will  be  initiating  a  field  study  entitled  Enhanced  Biodegradation  through  Soil 
Venting.  This  study  will  consist  of  four  experimental  plots,  two  contaminated  by  JP-4 
and  two  uncontaminated  by  fuel.  Moisture  content,  nutrient  concentration  and 
venting  rate  will  be  adjusted  in  the  experimental  plots  to  study  the  effect  of  these 
parameters  on  biodegradation.  As  part  of  this  proposal,  soil  will  be  collected  from 
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each  of  the  experimental  plots  and  biodegradation  will  be  assessed  in  the  laboratory 
by  the  measurement  of  release  of  radiolabeled  carbon  dioxide  after  the  addition  of 
representative  radiolabeled  hydrocarbons  to  the  soil  samples.  It  is  proposed  to  use 
hexadecane,  a  representative  straight  chain  alkane,  and  toluene  and  naphthalene, 
representative  aromatic  hydrocarbons. 

3.  To  characterize  the  microorganisms  responsible  for  mediating  the 
biodegradation  of  JP-4. 

The  abundance  and  activity  of  microorganisms  in  the  three  experimental  soils 
will  be  determined  in  order  to  assess  the  effect  of  treatment  on  the  microbial 
population.  In  addition,  microorganisms  capable  of  utilizing  each  of  the  three  model 
hydrocarbons  will  be  isolated  and  characterized  as  to  their  biodegradative  potential. 
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MATERIALS  AND  METHODS 


A-  Biodegradation  of  JP-4  in  soil 

Biodegradation  of  JP-4  was  assessed  using  a  procedure  similar  to  that  used  for 
the  assessment  of  JP-8  biodegradation.  For  this  study,  25  g  (dry  weight  equivalent) 
aliquots  of  soil  were  placed  in  150  ml  milk  dilution  bottles.  Two  sets  of  bottles  were 
used,  one  set  containing  untreated  (active)  soil  and  the  other  receiving  soil  treated 
with  2%  (wt)  HgClj.  Each  bottle  received  250  ul  of  JP-4.  Bottles  were  incubated 
in  a  horizontal  position  with  caps  removed.  Triplicate  bottles  were  removed  at  0 
time,  and  5,  10,  20  and  30  days.  Three  bottles  containing  25  g  of  soil  (dry  weight) 
were  weighed  and  incubated  under  identical  conditions  to  the  JP-4  bottles.  These 
bottles  were  weighed  weekly  to  calculate  weight  loss.  The  corresponding  amount  of 
water  was  added  to  the  JP-4  bottles  to  maintain  a  constant  moisture  level.  Soil  was 
extracted  by  addition  of  25  mis  of  CSj,  followed  by  shaking  for  5  min  on  a  wrist 
action  mechanical  shaker.  Extracts  were  decanted  from  the  soil,  laced  in  vials  and 
analyzed  by  high  resolution  capillary  gas  chromatography  with  a  Perkin-Elmer  Model 
8500  gas  chromatograph  equipped  with  a  flame  ionization  detector,  using  a 
Durabond  5  capillary  column  programmed  to  begin  at  40°  C  with  a  4  min  isr  thermal 
period  followed  by  3°/min  increase  to  25(f  C. 

B.  Biodegradation  of  hydrocarbon  components  of  JP-4  in  experimental  plots 

To  measure  soil  degradation  of  these  hydrocarbons,  a  gas  train  similar  to  that 
employed  by  Ward  (1985)  was  used.  Soil  was  placed  in  Erlenmeyer  flasks  attached 
to  a  gas  train  supplying  COj  free  air  at  a  controlled  rate.  Three  series  of  flasks  were 


27-9 


set  up,  one  for  each  of  three  model  hydrocarbons,  toluene,  hexadecane  and 
naphthalene.  JP-4  containing  a  spike  of  one  of  the  above  radiolabeled  compounds 
was  added  to  triplicate  samples  of  soil  from  each  experimental  plot.  As  the  air 
exited  the  flask,  it  passed  through  a  Tenax  AC  trap  (Heitkamp  et  al.,  1987)  to  collect 
volatilized  hydrocarbon  followed  by  a  CO2  trap.  Both  traps  were  sampled  at  intervals 
over  the  experimental  period  and  the  amount  of  radioactivity  in  each  measured  by 
liquid  scintillation  counting. 

C.  Characterization  of  microbial  populations  in  soils  from  bioventing  project  plots. 

At  initiation  of  the  soil  venting  project  (September,  1989)  and  at  termination 
of  the  project  (February,  1989),  soil  samples  from  the  four  experimental  plots  were 
collected,  transported  to  IPFW  and  analyzed  for  microbial  activity  by  the  following 
methods: 

1.  Direct  counts  using  epifluorescence  microscopy  according  to  the  method  of 
Balkwill  and  Ghiorse  (1985). 

2.  Viable  counts  using  PYTG  agar,  dilute  PYTG  agar  and  soil  extract  agar  as 
described  in  Balkwill  and  Ghiorse  (1985). 

3.  Heterotrophic  activity  as  measured  by  utilization  of  glucose  or  a  mixture  of 
amino  acids.  In  this  procedure,  a  ^^C-labeled  substrate  was  added  to  aliquots  of  soil 
from  the  experimental  plots  in  60  ml  serum  vials.  Five  replicate  vials  were  prepared 
per  treatment;  two  vials  received  0.5  ml  2N  HjSO^  to  serve  as  killed  cell  controls,  the 
other  three  assessed  the  activity  of  the  microbial  population.  Each  vial  received  the 
labeled  substrate.  Flasks  were  capped  with  rubber  stoppers  fitted  with  center  wells 
containing  0.1  ml  of  ION  NaOH  absorbed  onto  a  filter  paper  wick.  Vials  were 
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incubated  for  an  appropriate  time  interval.  At  the  end  of  the  incubation  period, 
reaction  in  the  active  vials  was  stopped  by  the  addition  of  0.5  ml  2N  H2SO4.  Vials 
were  incubated  with  shaking  for  an  additional  hour  to  ensure  complete  trapping  of 
CO2.  Wicks  were  removed,  placed  in  a  scintillation  cocktail  (Budgetsolve,  Research 
Products  International),  and  counted  by  liquid  scintillation  counting  (Beckman  Model 
LS  9800). 

4.  Hydrocarbon  mineralization  was  assessed  in  a  similar  fashion  using  each  of 
the  three  model  hydrocarbon  substrates. 

5.  Numbers  of  hydrocarbon  degraders  were  determined  using  the  Most 
Probable  Number  (MPN)  procedure  as  modified  by  Somerville  et  al  (1985)  for  each 
of  the  three  model  hydrocarbon  substrates. 
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RESULTS 


A.  Biodegradation  of  JP-4  in  soil 

Results  of  quiescent  bottle  tests  using  JP-4  in  Chippola  soil  are  shown  in 
Table  1.  Disappearance  of  twenty  hydrocarbon  components  of  JP-4  were  followed 
over  a  two  week  period.  For  hydrocarbons  having  boiling  points  below  that  of 
undecane,  essentially  all  of  the  hydrocarbon  disappeared  from  flasks  by  day  5. 
Hydrocarbons  with  boiling  points  higher  than  undecane  were  removed,  but  although 
not  to  the  same  extent  as  the  lower  boiling  point  hydrocarbons.  At  day  5,  no 
sigificant  differences  were  noted  when  comparing  disappearance  in  flasks  containing 
soil  treated  with  HgCl2  as  compared  to  untreated  soil.  However,  ast  day  10, 
hydrocarbon  concentrations  for  the  longer  chain  alkanes  were  lower  in  untreated 
than  treated  soils,  indicating  that  biodegradation  did  play  a  significant  role  in 
removal  of  these  hydrocarbon  components. 

B.  Biodegradation  of  model  hydrocarbons  in  soil  from  treatment  plots 

Toluene  was  removed  from  soil  flasks  by  evaporation  as  indicated  by 
entrapment  of  radiolabeled  toluene  on  the  resin  filters  by  the  second  sampling  day. 
No  radiolabel  was  recovered  in  the  KOH  traps. 

In  soil  samples  taken  from  treatment  plots  at  the  beginning  of  the 
bioremediation  project,  hexadecane  was  biodegraded  in  the  soil  from  the 
uncontaminated  site  as  indicated  in  Figure  1.  No  significant  biodegradation  was 
observed  in  soil  taken  from  the  contaminated  site.  Naphthalene  was  biodegraded  in 
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both  soils  to  approximately  the  same  extent. 

In  soil  samples  taken  from  treatment  plots  at  the  end  of  the  bioremediation 
project,  biodegradation  was  observed  only  in  the  case  of  hexadecane  in 
uncontaminated  soil  (Fig.  2). 

C.  Characterization  of  microbial  populations  in  soils  from  treatment  plots 

At  the  beginning  of  bioremediation,  viable  counts,  direct  counts,  numbers  of 
hydrocarbon  degraders  and  hydrocarbon  degrading  activity  was  higher  in  soil  from 
the  contaminated  site  than  the  uncontaminated  site.  Heterotrophic  activity,  however, 
was  higher  in  the  uncontaminated  than  the  contaminated  soil. 

At  the  end  of  bioremediation,  viable  counts,  direct  counts  and  heterotrophic 
activity  were  higher  in  the  uncontaminated  than  the  contaminated  soil.  The  number 
of  hexadecane  degraders  was  slightly  higher  in  the  uncontaminated  than  the 
contaminated  soil.  No  naphthalene  or  toluene  degraders  were  detected  in 
uncontaminated  soil.  Hydrocarbon  degrading  activity  was  below  the  detection  limit 
in  both  contaminated  and  uncontaminated  soils. 
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RECOMMENDATIONS 


1.  Biodegradation  is  a  factor  in  the  removal  of  high  boiling  point  hydrocarbon  ^ 

components  of  jet  fuel.  Bioremediation  projects  therefore  have  the  highest  chance 

of  success  in  focusing  on  these  hydrocarbons. 

2.  Biodegradation  of  hyxadecane  was  lower  in  hydrocarbon  contaminated  soil  * 

than  in  uncontaminated  soil,  in  spite  of  higher  numbers  of  degraders.  This  suggests 

that  environmental  factors  may  be  limiting  biodegradation  in  field  situations. 

3.  Hydrocarbon  contamination  enriches  for  a  microbial  population  containing 
significant  numbers  of  hydrocarbon  degrading  bacteria.  Manipulation  of  this 

! 

population  to  maximize  the  rate  of  biodegradation  may  be  a  cost  effective  means  of 

I 

enhancing  removal  of  fuel  from  contaminated  soil. 


* 


27-14 


REFERENCES 


Balkwill,  D.  L.  and  W.  C.  Ghiorse.  1985.  Characterization  of  subsurface  bacteria 
associated  with  two  shallow  aquifers  in  Oklahoma.  Appl.  Environ.  Microbiol. 
50:580-588. 

Bartha,  R.  1986.  Biotechnology  of  petroleum  pollutant  biodegradation.  Microb. 
Ecol.  12:155-172. 

Carlson,  R.  E.  1981.  The  biological  degradation  of  spilled  jet  fuels:  a  literature 
review.  USAF/ESC  Report  ESL-TR-81-50. 

Cook,  A.  M.,  H.  Grossenbacher,  and  R.  Hutter.  1983.  Isolation  and  cultivation  of 
microbes  with  biodegradative  potential.  Experientia  39:1191-1198. 

Haigler,  B.  E.,  S.  F.  Nishino,  and  J.  C.  Spain.  1988.  Degradation  of  1,2- 
dichlorobenzene  by  a  Pseudomonas  sp.  Appl.  Environ.  Microbiol.  54:294-301. 

Heitkamp,  M.  A.,  J.  P.  Freeman  and  C.  E.  Cemiglia.  1987.  Naphthalene 
biodegradation  in  environmental  microcosms:  estimates  of  degradation  rates  and 
characterization  of  metabolites.  Appl.  Environ.  Microbiol.  53:129-136. 

Lee,  M.  D.,  J.  M.  Thomas,  R.  C.  Borden,  P.  B.  Bedient,  C.  H.  Ward,  and  J.  T. 


27-15 


Wilson.  1988.  Biorestoration  of  aquifers  contaminated  with  organic  compounds. 
CRC  Crit.  Rev.  Environ.  Control  18:29-89. 

Lee,  M.  D.,  J.  T.  Wilson,  and  C.  H.  Ward.  1984.  Microbial  degradation  of  selected 
aromatics  in  a  hazardous  waste  site.  Devel.  Ind.  Microbiol.  25:'^57-565. 

Pritchard,  P.  H.,  T.  P.  Maziarz,  L.  H.  Mueller,  and  A.  W.  Bourquin.  1988. 
Environmental  fate  and  effects  of  shale-derived  jet  fuel.  USAF/ESC  Report  ESL- 
TR-87-09. 


Spain,  J.  C.  and  C.  C.  Somerville.  1985.  Fate  and  toxicity  of  high  density  missile 
fuels  RJ-5  and  JP-9  in  aquatic  test  systems.  Chemosphere  14:239-248. 

Spain,  J.  C.,  C.  C.  Somerville,  L,  C.  Butler,  T.  J.  Lee,  and  A.  W.  Bourquin.  1983. 
Degradation  of  jet  fuel  hydrocarbons  by  aquatic  microbial  communities.  USAF/ESC 
Report  ESL-TR-83-26. 

Somerville,  C.  C.,  C.  A  Monti,  and  J.  C.  Spain.  1985.  Modification  of  the  ^'‘C  most- 
probable-number  method  for  use  with  nonpolar  and  volatile  substrates.  Appl. 
Environ.  Microbiol.  49:711-713. 

Thomas,  J.  M.,  M.  D.  Lee,  M.  J.  Scott  and  C.  H.  Ward.  1989.  Microbial  ecology  of 


the  subsurface  at  an  abandoned  creosote  waste  site.  J.  Ind.  Microbiol.  4:109-120. 


Ward,  T.  E.  1985.  Characterizing  the  aerobic  and  anaerobic  microbial  activities  in 
surface  and  subsurface  soils.  Environ.  Toxicol.  Chem.  4:727-737. 

Wilson,  J.  T.,  L.  E.  Leach,  M.  Henson,  and  J.  N.  Jones.  1986.  In  situ  biorestoration 
as  a  ground  water  remediation  technique.  Ground  Water  Monitor.  Review.  Fall, 
1986:56-64. 


27-17 


TABLE  1.  REMOVAL  OF  HYDROCARBONS  FROM  UNTREATED  SOIL 


Hydrocarbon 

Concentration  in  extract  (ug/ml)  at; 

O  time 

5  day 

10  day 

15  day 

2,3-dimethylpentane 

.061 

0 

0 

0 

heptane 

.277 

0 

0 

0 

methylcyclohexane 

.094 

0 

0 

0 

toluene 

.087 

0 

0 

0 

3-methylheptane 

.136 

0 

0 

0 

octane 

.197 

0 

0 

0 

ethylbenzene 

.060 

0 

0 

0 

m-xylene 

.067 

0 

0 

0 

o-xylene 

.034 

0 

0 

0 

nonane 

.118 

0 

0 

0 

iso-butylbenzene 

.028 

0 

0 

0 

1,3,5-trimethylbenzene 

.035 

0 

0 

0 

1 ,2,4-trimethylbenzene 

.068 

0 

0 

0 

decane 

.102 

0 

0 

0 

undecane 

.108 

.018 

.012 

.011 

dodecane 

.111 

.056 

.016 

.014 

tridecane 

.099 

.067 

.022 

.018 

tetradecane 

.057 

.067 

.020 

.019 

pentadecane 

.027 

.032 

.017 

.016 

hexadecane 

.015 

.010 

.008 

0 
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TABLE  2.  REMOVAL  OF  HYDROCARBONS  FROM  TREATED  SOIL 


Hydrocarbon 

Concentration  in  extract  (ug/ml)  at: 

0  time 

5  day 

10  day 

15  day 

2,3-dimethylpentane 

.050 

0 

0 

0 

heptane 

.235 

0 

0 

0 

methylcyclohexane 

.081 

0 

0 

0 

toluene 

.091 

0 

0 

0 

3-methylheptane 

.121 

0 

0 

0 

octane 

.177 

0 

0 

0 

ethylbenzene 

.057 

0 

0 

0 

m-xylene 

.063 

0 

0 

0 

o-xylene 

.033 

0 

0 

0 

nonane 

.110 

0 

A 

\J 

0 

iso-butylbenzene 

.027 

0 

0 

0 

1,3,5-trimethylbenzene 

.035 

0 

0 

0 

1,2,4-trimethylbenzene 

.057 

0 

0 

0 

decane 

.095 

0 

0 

0 

undecane 

.102 

.018 

.010 

.006 

dodecane 

.104 

.056 

.015 

.017 

tridecane 

.094 

.070 

.043 

.029 

tetradecane 

.056 

.065 

.049 

.029 

pentadecane 

.031 

.038 

.032 

.018 

hexadecane 

.015 

.010 

.008 

.005 
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TABLE  3.  ABUNDANCE  AND  ACTIVITY  OF  THE  MICROBIAL 
COMMUNITY  AT  THE  BEGINNING  OF  BIOREMEDIATION 


Soil  Treatment 


Contaminated 

Uncontaminated 

Viable  counts 
(per  g  soil) 

2.48  X  10^ 

1.97  X  10^ 

Direct  counts 
(per  g  soil) 

3.02  X  10* 

1.34  X  10* 

Hydrocarbon  degraders 
(per  g  soil) 

Hexadecane 

5.20  X  10*^ 

8.32  X  10^ 

Naphthalene 

8.32  X  10^ 

8.32  X  10^ 

Toluene 

8.32  X  10*^ 

8.32  X  lO'* 

Heterotrophic  activity 
(turnover  time  in  hours) 

96.0 

51.6 

Hydrocarbon-degrading 
activity  (turnover  time 
in  hours) 

Hexadecane 

ND 

ND 

Naphthalene 

459.9 

977.2 

Toluene 

2074 

ND 

ND  =  none  detected 


TABLE  4.  ABUNDANCE  AND  ACTIVITY  OF  THE  MICROBIAL 
COMMUNITY  AT  THE  END  OF  BIOREMEDIATION 


Soil  Treatment 


Contaminated 

Uncontaminated 

Viable  counts 
(per  g  soil) 

3.19  X  10^ 

5.74  X  10"* 

Direct  counts 
(per  g  soil) 

1.35  X  10* 

8.88  X  10* 

Hydrocarbon  degraders 
(per  g  soil) 

Hexadecane 

2.29  X  10* 

3.47  X  10* 

Naphthalene 

5.2  X  10* 

ND 

Toluene 

5.2  X  10^ 

ND 

Heterotrophic  activity 
(turnover  time  in  hours) 

217.0 

9.40 

Hydrocarbon-degrading 
activity  (turnover  time 
in  hours) 

Hexadecane 

ND 

ND 

Naphthalene 

ND 

ND 

Toluene 

ND 

ND 

ND  =  none  detected 
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FRACl  ION  UTILIZED 
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Figure  1.  Hydrocarbon  biodegradation  at  the  beginning  of  bioremediation. 
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FRACTION  UTILIZED 
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Figure  2.  Hydrocarbon  biodegradation  at  the  end  of  bioremediation. 
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ABSTRACT 

Commercial  airports  and  Air  Force  bases  throughout  the  United 
States  train  personnel  to  extinguish  fires  associated  with  aircraft 
flight  operations.  A  waste  stream  consisting  of  jet  fuel,  surfactant- 
based  firefighting  agents,  and  particulates  is  created  by  these 
necessary  exercises.  The  Air  Force  is  developing  a  treatment  scheme  for 
these  wastes. 

This  project  evaluated  two  pretreatment  approaches  for  reducing 
foaming  potential  and  enhancing  biodegradability  of  the  wastewater. 
First,  chemically  coagulation  was  examined  as  a  way  of  surfactant 
removal  because  they  are  colloidal  in  nature.  Aluminum  sulfate  and 
ferric  chloride  were  studied  as  the  primary  coagulants.  Chemical 
oxidation  was  the  second  approach  investigated  as  a  way  to  alter  the 
foaming  and  bioinhibitory  character  of  the  surfactants.  Hydrogen 
peroxide  and  potassium  permanganate  were  used  for  this  phase  of  the 
project.  A  synthetic  waste  and  grab  samples  from  three  training 
facilities  were  evaluated  for  treatability  by  both  approaches. 

The  results  show  that  the  surfactants  in  the  wastewater  can  not  be 
removed  through  the  coagulation/flocculation  process.  However,  an 
interesting  foam  inhibition  effect  was  observed  with  the  addition  of  the 
coagulant  and  adjustment  of  pH.  In  particular,  if  the  pH  was  held  to 
about  5.0  units,  foam  generation  did  not  occur,  but  only  if  a  small 
amount  of  alum  haa  been  previously  added. 
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Regarding  chemical  oxidation,  hydrogen  peroxide  proved  of  no 
value.  However,  the  addition  of  potassim  permanganate  was  found  to 
permanently  eliminate  foam  generation  potential  of  the  wastewaters 
evaluated.  This  was  achieved  at  moderate  doses  of  permanganate  if  the 
pH  was  lowered  to  a  level  of  6.5  units.  However,  the  biodegradability 
and  effective  organic  content  of  the  samples  was  not  significantly 
altered. 
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PRETREATMENT  OF  WASTEWATERS  GENERATED  BY 
FIREFIGHTER  TRAINING  FACILITIES 


by 

Dennis  D.  Truax,  Ph.D.,  P.E. 

INTRODUCTION 


Training  personnel  in  the  techniques  of  dealing  with  aircraft 
fires  is  an  important  activity  at  airports.  In  addition  to  the  numerous 
training  facilities  at  commercially-operated  airports,  the  Air  Force  has 
over  one  hundred  firefighter  training  sites  located  at  military 
airfields. 

Recently  the  Fire  Technology  Branch  (RDCF)  of  the  Air  Force 
Engineering  and  Services  Center  (AFESC)  designed  an  environmentally 
secure  firefighter  training  facility  (FTF) .  In  general,  the  FTF 
consists  of  a  circular  bum  pit,  a  vehicle  maneuvering  area,  fuel 
storage  and  supply  system,  smoke  reduction  apparatus,  and  a  wastewater 
handling  area.  The  bum  pit  has  plastic  and  clay  liners.  These  prevent 
the  migration  of  liquid  to  the  surrounding  soil  and  underlying 
groundwater.  After  a  training  activity,  the  pit  will  contain  a  mixture 
of  water,  unburned  aviation  fuel  (i.e.,  JP-4),  soot  particles,  and  a 
fire  extinguishing  agent  consisting  principally  of  aqueous  film-forming 
foam  (AFFF).[l-4]  This  combination  of  liquids  and  solids  constitutes  a 
wastewater  'hich  must  be  treated  before  release  into  the  environment. 

At  present,  the  specifications  for  the  wastewater  handling  system 
call  for  an  oil/water  separator.  This  unit  was  designed  to  reduce  the 
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oil  and  grease  content  of  the  wastewater  to  or  below  25  mg/1  and  allow 
recovery  of  unbumed  fuel.  [1]  At  the  same  time,  particulates  will  be 
removed  to  degree  as  yet  undetermined.  Therefore,  the  waste  stream 
from  the  separator  will  consist  primarily  of  water  laden  with  AFFF  i 
water-miscible  hydrocarbons  from  the  unburned  fuel. 

The  aqueous  effluent  from  the  separator  will  then  enter  an 
impermeable  holding  basin  designed  in  such  a  way  to  enhance  evaporation. 
In  the  event  that  "...unfavorable  meteorological  conditions  (high 
precipitation  and  low  evaporation)  cause  the  basin  tc  exceed  its  design 
capacity...",  the  wastewater  will  have  to  be  removed  and  undergo  further 
treatment. [1]  In  that  many  regions  experience  rates  of  precipitation 
that  are  greater  than  the  rate  of  evaporatio  i,  the  Environics  Division 
(RDVW)  of  AFESC  is  charged  with  evaluation  of  appropriate  wastewater 
treatment  schemes.  ^'’1 

Though  the  exact  chemical  composition  of  AFFF  is  proprietary  and 
varies  between  manufacturers.  It  is  known  that  AFFF  consists  of  a  mix¬ 
ture  of  fluorocarbon  and  non-fluorocarbon  surfactants,  solubilizers,  and 
water.  "Tie  surfactant  molecules  impart  to  the  wastewater  properties 
such  as  foaming,  emulsification,  and  particle  suspension. [5]  Further, 
these  chemicals  inhibit  biological  activity  when  present  in  sufficient 
concentration. [3,6] 

JP-4  is  a  classification  of  the  jet  fuel  new  utilized  by  the  Air 
Force.  It  is  the  fuel  used  for  Air  Force  FTF  activities.  JP-4  is 
composed  of  several  potentially  hazardous  organics  including  the 
priority  pollutants  toluene,  ethylbenzene,  and  napthalene.  However,  the 
water-miscible  components  of  JP-4  are  primarily  low  molecular  weight 
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cycloalkanes  and  aromatics.  A  large  portion  of  these  compounds  are  lost 
during  FTF  activities  due  to  their  lew  vapor  pressures.  Therefore,  the 
fire  training  facility  wastewater  can  be  expected  to  predominantly 
contain  higher  molecular  weight  alkanes  that  normally  have  very  low 
solubility  values. [7] 

Various  treatment  alternatives  exist  which  might  be  able  to  handle 
these  materials.  These  choices  can  be  divided  into  two  broad 
classifications,  segregated  and  joint  treatment.  In  segregated 
treatment,  a  separate  and  complete  wastewater  processing  scheme  would  be 
developed  to  provide  an  effluent  which  could  be  discharge  into  the 
environment.  Zachritz [2]  and  ChantS]  have  examined  various  systems 
finding  mixed  results.  Further,  given  the  highly  variable  nature  of 
waste  stream  flows  and  characteristics,  consistent  operation  of 
segregated  systems  may  be  extremely  difficult. 

Joint  treatment  is  the  discharge  of  wastewater  into  a  publicly 
owned  treatment  works  (POIW)  or  the  sewerage  system  of  the  authority 
operating  the  FTF;  i.e.,  airbase  sewer  or  treatment  plant.  Based  on 
data  from  a  recent  survey [8],  over  forty  percent  of  Air  Force  installa¬ 
tions  are  using  POTWs  to  treat  their  wastewaters.  It  is  anticipated 
that  virtually  all  of  these  facilities  incorporate  biological  treatment 
processes  which  could  be  adversely  impacted  by  the  addition  of  untreated 
FTF  wastewater.  Of  those  airbases  still  operating  treatment  facilities, 
about  eighty  percent  use  biological  processes.  Of  these,  many  are 
having  trouble  meeting  treatment  requirements  because  of  age  or  design. 
The  addition  of  untreated  FTF  wastewaters  could  aggravate  this  problem. 
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If  joint  treatment  at  POTWs  is  practiced,  pretreatment  of  this 
wastewater  will  be  required  under  current  federal  regulations.  This 
would  be  due,  in  part,  to  the  potential  of  these  discharges  to  cause  two 
problems.  The  first  is  excessive  foaming  in  the  treatment  plant's 
reactors  and  effluent.  The  other  is  a  reduction  is  effluent  quality 
resulting  from  a  bioinhibitory  nature  of  the  wastewaters  components 
and/ or  overloading  of  biological  reactors  unacclimated  to  these 
components.  Both  problems  have  been  observed  during  laboratory  testing 
of  wastewater  samples  from  FTFs  as  well  as  investigations  into  its 
components. [3, 4, 6, 9] 

This  study  is  an  initial  examination  of  selected  pretreatment 
strategies  for  wastewaters  generated  by  Air  Force  FTFs.  It  seeks  to 
determine  if  chemical  coagulation  and  chemical  oxidation  are  viable 
pretreatment  techniques  for  reducing  foaming  potential  and  increasing 
biodegradability  of  these  wastewaters.  Some  work  has  been  performed  in 
this  area. [3 , 9, 10]  However,  much  of  the  data  collected  for  FTF  wastes 
is  dated.  The  chemicals  and  operational  procedures  were  different  from 
those  used  today.  Some  of  the  information  reported  seems  contradictory. 
Finally,  these  projects  did  not  thoroughly  examine  alternatives  which 
might  have  application  to  pretreatment. 
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OBJECTIVES  OF  THE  RESEARCH  EFFORT 


The  primary  goal  of  pretreating  this  wastewater  is  the  reduction 
of  the  foaming  potential.  A  secondary  goal  is  improving  the  potential 
for  bacterial  conversion  of  the  organics  contained  in  it.  The  litera¬ 
ture  shows  there  are  a  number  of  alternatives  for  removing  oils  and 
surfactants. [11-13]  These  processes  include  chemical  oxidation  and 
chemical  coagulation/flocculation.  Because  these  processes  are 
appropriate  for  treating  the  components  of  FTF  wastewaters,  it  is 
logical  to  expect  that  they  would  have  application  in  pretreatment  of 
the  waste  stream.  Previous  research  efforts  would  appear  to  support 
this. [3,9, 10]  This  project  will  examine  each  of  these  in  some  detail. 
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BACKGROUND 


AFFF  is  an  effective  fire  suppressant  due,  in  part,  to  the  surface 
active  agents  contained  in  it.  Surfactants  are  materials  which  dis¬ 
solve,  or  tend  to  dissolve,  in  water  and  non-aqueous  materials.  A  sur¬ 
factant  molecule  contains  a  strongly  hydrophobic  group  and  a  strongly 
hydrophilic  one.  Such  molecules  tend  to  congregate  at  interfaces 
between  the  aqueous  medium  and  other  phases  of  the  system  (e.g.,  air, 
oily  liquids,  particulates).  They  lower  the  surface  tension  of  the 
water  at  these  interfaces  thus  imparting  properties  such  as  foaming, 
emulsification,  and  particle  suspension.  [5]  To  reduce  this  effect,  one 
must  remove  or  chemical  modify  the  surfactant  in  a  way  which  eliminates 
its  hydrophilic  and/or  hydrophobic  property. [7,  14-16] 

Chemical  Coagulation 

Surfactants  are  colloidal  in  nature.  It  is  customary  to  distin¬ 
guish  two  classes  of  colloids  whose  general  behavior  is  entirely  differ¬ 
ent.  These  classes  are  called  lyophobic  and  lyophilic  colloids. 
Lyophilic  colloids  have  a  strong  affinity  for  the  dispersing  solvent 
while  in  lyophobic  colloids  this  attraction  is  weak  or  absent.  As 
discussed  above,  surfactants  in  water  have  both  lyophilic  and  lyophobic 
components. 

In  most  systems,  colloids  are  held  in  suspension,  or  stabilized, 
because  of  electrostatic  forces  which  they  have,  or  develop,  relative  to 
the  surrounding  water.  [17]  In  particular,  colloids  of  similar  nature 
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will  develop  similar  charges.  Bodies  having  similar  charges  repeal  each 
other.  Due  to  this  repulsive  forces,  the  colloids  remain  in  suspension. 
Further,  the  polar  nature  of  the  water  molecule  results  in  an  attraction 
of  the  colloid  to  the  bulk  liquid  phase  which  retards  separation. 

The  parameters  governing  flocculation  of  dispersed  droplets  of  an 
emulsion  and  dispersed  solid  particles  are  the  same. [15,  17,  18]  The 
term  stability,  when  applied  to  emulsions,  refers  to  the  resistance  of 
the  dispersed  droplets  to  coalescence.  The  floating  or  settling  of  the 
droplets  because  of  a  difference  in  density  between  them  and  the 
continuous  phase  is  not  considered  instability. 

An  important  physical  property  of  colloidal  dispersions  is  their 
tendency  to  aggregate.  Collisions  between  dispersed  colloids  occur  fre¬ 
quently  as  a  result  of  Brownian  motion.  Attractive  forces,  termed  van 
der  Walls'  forces,  exist  between  all  colloidal  particles  regardless  of 
dissimilar  chemical  natures.  [19]  These  attractive  forces  are  responsi¬ 
ble  for  the  colloidal  aggregation.  However,  their  magnitude  depends 
upon  the  kinds  of  atoms  which  make  up  the  colloidal  particles  and  the 
density  of  the  particles.  Also,  the  attractive  force  decreases  with 
distance  separating  the  particles. 

When  considering  the  aggregation  of  colloids  it  is  useful  to 
distinguish  between  colloidal  transport  and  colloidal  destabilization. 
Colloidal  transport  is  a  physical  process  related  to  the  probability 
that  the  colloid's  kinetic  energy  is  sufficient  to  overcome  the 
repulsive  electrostatic  energy  barrier.  This  is  accomplished  by 
phenomena  such  as  Brownian  diffusion,  fluid  motion,  and  sedimentation. 
It  is  controlled  by  such  physical  parameters  as  temperature,  velocity 
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gradient,  and  colloidal  size.  The  destabilization  of  colloidal 
suspensions  is  achieved  by  reducing  the  energy  of  interaction  between 
two  colloids  and  by  formation  of  chemical  bridges  which  aggregate 
colloids  into  three-dimensional  floe  networks. [19,  20] 

The  metal  ions  used  for  coagulation  aid  in  the  latter  mechanism. 
Through  considerations  of  ionic  strength,  the  addition  of  highly  charged 
metals  reduces  the  energy  of  interaction.  Then,  as  the  metals 
hydrolyzed  in  water  to  form  polynuclear  complexes,  they  aggregate  with 
the  destabilized  particles  due  to  van  der  Waals  forces.  This  process 
can  be  aided  by  agitation  of  the  bulk  liquid.  This  causes  the  particles 
to  come  in  close  vicinity  and  increases  the  chances  that  they  will 
collide  and  coalesce. [17] 

The  most  widely  used  coagulants  in  water  and  wastewater  treatment 
are  aluminum  and  iron  salts.  Aluminxm  sulfate  is  employed  more  fre¬ 
quently  because  it  is  usually  cheaper.  Iron  has  the  advantages  of  form¬ 
ing  a  denser,  rapid  settling  floe  over  a  wider  pH  ranges. [17]  The  best 
pH  range  when  using  alum  is  A. 5  to  8.0  due  to  the  low  solubility  of 
aluminum  hydroxide  within  this  range.  For  ferric  chloride  a  broader  pH 
range  of  A  to  12  is  optima!. 

Chemical  Oxidation 

Ladbury  and  Cullis[22]  suggest  two  principle  mechanisms  exist  by 
which  organic  compounds  are  oxidized  by  acid  permanganate.  The  first  is 
direct  oxidation  by  the  permanganate  ion.  The  second  involves  formation 
of  an  intermediate  Mn  ion  and  subsequent  oxidation  by  hydroxyl 
radicals  produced  from  this  ion  and  water.  For  direct  oxidation  to 


proceed,  the  organic  should  have  a  hydrogen  atom  amenable  to 

4+ 

substitution  and/or  oxidation.  No  such  site  seems  mandated  if  the  Mn 
intermediate  provides  oxidation  because  this  processes  exhibits  a  higher 
activation  energy  than  direct  oxidation.  Though  the  kinetics  of 
permanganate  oxidation  of  organic  compounds  can  be  first-order 
initially,  this  relationship  can  not  be  maintained.  In  many  instances 
second-order  kinetics  are  exhibited  throughout  acid  permanganate 

oxidation. 

Probably  the  most  common  oxidative  process  associated  with  perman¬ 
ganate  is  syn  hydroxylation  of  alkenes.  [23,  24]  This  process  produces 
glycols  through  the  formation  of  cyclic  intermediates  followed  by  cleav¬ 
age  of  the  oxygen-metal  bond  as  illustrated  below: 

0  0  -C - C- 

C=C  +  Mn  - >  0  0  — ->  -c— C-  +  MnO^ 

0  0~  Mn  OH  OH 

0  o" 

Limiting  the  reaction  to  the  hydroxylation  reaction  illustrated 
above  is  very  difficult  under  normal  conditions.  Therefore,  it  is  pre¬ 
sumed  that  further  oxidation  of  the  glycol  occurs  resulting  in  the 
destruction  of  the  organic  molecule. 

When  dealing  with  FTF  wastewaters,  the  most  significant  surfactant 
is  of  the  linear  alkyl  sulfonate  type.  Typically,  the  aggregate 
surfactant  material  contains  both  a  hydrocarbon  and  fluorohydrocarbon 
surfactant  specie  bonded  to  a  glycol  carrier. [21]  Alkyl  sulfonates 
having  a  branched  alkyl  residue  are  formed  by  direct  sulfonation  of 
naphthenic  and  branched  paraffin  hydrocarbons.  It  is  suggested  that  the 
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sulfonation  of  these  tertiary  olefins  produces  a  product  with  a  radical 
substitution  at  an  activated  methyl  group  while  maintaining  the 
integrity  of  the  double  bond  as  illustrated  below:  • 

R'  F 

I 

R-CH=C  +  SO^  __>  R-CH=C 

CH^  CH^-SO^H  • 

The  presence  of  the  double  bond  as  present  in  the  product  of  this  reac¬ 
tion  provides  a  reactive  site  for  direct  permanganate  oxidation. 

Research  has  shown  that  oxidation  of  aromatic  hydrocarbons  in  an  i 

acidic  environment  requires  reduction  of  the  permanganate  ion  to  man¬ 
ganese  dioxide  and  oxidation  of  the  aromatic  ring  occurs  concurrently 
with  the  side  chain  destruction. [22-24]  Further,  the  extent  of  ring 
oxidation  decreases  as  the  size  of  the  alkyl  group  in  the 
monoalkylbenzenes  increases.  This  facts  imply  that  direct  permanganate 
oxidation  would  be  slow  to  react  with  fuel  hydrocarbons.  ' 

Merz  and  Waters  [25]  examined  the  reaction  of  the  free  hydroxyl 
radicals  produced  by  hydrogen  peroxide  acting  on  ferrous  iron.  They 
determined  that  the  complete  reaction  between  ferrous  ion  and  hydrogen 
peroxide  occurs  in  two  steps  as  illustrated  below. 

Fe^"^  +  H2O2  - >  Fe^"^  +  (H0;)~  +  'OH 

and 

Fe^"*"  +  'OH  - ►  Fe^"^  +  (HO:)” 

They  further  found  this  radical,  when  in  an  acidified  solution, 
capable  of  oxidizing  water-soluble,  aliphatic  compounds  through  dehydro¬ 
genation.  They  indicate  the  dehydrogenation  mechanism  was  followed  by 
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chain  and/or  non-chain  reactions  which  produce  alkenes  and  alcohols  in 
some  cases  and  refractory  products  in  others. 

If  the  dehydrogenation-oxidation  reactions  occur,  they  can  be 
sustained  with  a  small  amount  of  ferrous  salt.  This  results  in  the 
destruction  of  large  amounts  of  organic  material  with  only  a  little 
catalyst  required.  If  oxidation  of  the  organic  can  not  occur  without 
catalyst  being  added,  then  the  dehydrogenated  radical  stabilizes  and  the 
reaction  stops. [25] 

Finally,  it  is  important  to  emphasize  that  these  reactions  must  be 
performed  under  acidic  conditions  to  prevent  the  ionization  of  the 
molecular  hydrogen  peroxide.  Sims  recommends  an  initial  pH  of  4.0  units 
and  a  10  mg/L  dose  of  ferrous  ammonium  sulfate  (FAS)  catalyst. [26] 

Foam  Measurement  Techniques 

Emulsions  developed  by  combining  surfactants  and  water  have  a  high 
potential  for  foaming.  When  this  solution  enters  an  aeration  basin, 
foam  is  produced.  If  this  tank  is  a  biological  reactor  at  a  POIW,  oper¬ 
ational  problems  develop;  e.g.,  reduced  aeration  efficiency,  floating 
solids,  and  slippery  foam  covering  gantries  and  reactor  systems. 

A  significant  component  of  providing  pretreatment  of  FTF  waste 
streams  is  accurate  quantification  of  foaming  potential.  Without  a  re¬ 
producible  method  of  defining  the  foaminess  of  the  waste,  it  is  impos¬ 
sible  to  define  treatment  requirements  or  efficiencies.  The  literature 
presents  a  variety  of  foam  measurement  techniques.  However,  many  of 
these  are  nor,  suitable  for  this  application  due  to  method  specificity  or 
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FTF  waste  characteristics  (e.g.,  suspended  solids,  turbidity,  interfer¬ 
ing  complexes) . 

Bikerman' s [14]  work  was  aimed  at  establishing  foaminess  as  a 
definite  physical  property  of  a  liquid,  which  must  therefore  be 
independent  of  the  apparatus  used  and  the  amount  of  material  employed  in 
measuring  it.  The  approach  he  found  most  successful  employed  a  dynamic 
foam  meter.  For  this  procedure,  an  inert  gas  is  delivered  at  a  measured 
rate  through  a  porous  membrane  to  an  overlying  layer  of  solution.  The 
foam  develops  until  a  constant,  maximum  volume  is  reached. 

Using  this  procedure,  the  average  lifetime  before  bursting  of  a 
bubble  in  the  foam  describes  the  foaminess  of  the  liquid.  Designated  as 
Sigma,  this  characteristic  is  determined  using  the  following  equation; 


Sigma 


Vo  •  t 


V 


where:  Vo  is  the  maximum,  steady-state  volume  of  foam  produced,  t  is  the 
time  it  takes  to  reach  Vo,  and  V  is  the  total  volume  of  gas  used  to  gen¬ 
erate  the  foam  volume. 

This  approach  is  the  basis  of  most  recent  work  on  foaming  measure¬ 
ment.  These  efforts  have  examined  procedural  variables  including  geome¬ 
try  of  the  foam  reactor,  volume  of  solution  sparged,  type  of  sparger, 
and  rate  of  gas  delivery.  This  literature  indicates  that  no  single  set¬ 
up  will  provide  results  for  one  water  that  are  translatable  to  data  for 
another.  It  also  demonstrates  that  reproducible  results  can  be  obtained 
if  the  same  surfactant (s)  is  measured,  if  gas  flow  rates  are  moderate  to 
low,  and  if  sidewall  effects  of  the  reactor  are  kept  to  a  minimum. 
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GENERAL  RESEARCH  APPROACH 


During  this  evaluation  of  FTP  waste  stream  pretreatment,  the  pri-, 
mary  constraint  was  reduction  in  the  foaming  potential  of  the  wastejr., 
water.  Enhancement  of  bacterial  activity  was  viewed  as  a  secondary  cr%l 
teria.  This  set  of  priorities  was  established  for  various  reasons. 
First,  it  was  felt  there  was  a  good  probability  the  FTF  waste  would  be 
treated  at  a  POIW  which  incorporated  aeration  as  a  process.  Secondly, 
extensive  work  has  been  performed  to  define  limits  on  waste  input  tjg_^ 
biological  systems.  It  appears  that,  with  proper  control  over  the  bac^T 
terial  population  and  their  environment,  conventional  POIVJs  can  restSSSf 
the  organics  from  fire  fighter  training  facility  wastewaters. [27, 28]  ^ 

the  other  hand,  development  of  foam  from  FTF  wastes  would  disrupt  norme^ 
operation  of  the  POIW. 


S?.mples 

This  project  examined  pretreatment  of  four  wastewaters. 
first,  a  synthetic  waste,  was  used  during  initial  experimentation 
the  unavailability  of  actual  waste  samples.  A  water;  fuel:  AFFF  con-' 
centrate  ratio  of  1;  0.225;  0.005  was  selected  for  the  synthetic  wastey; 
This  ratio  was  felt  representative  of  the  relative  quantities  of  each^ 
constituent  used  during  an  FTF  exercise.  [28]  It  was  noted  that  during 
training  event  the  component  ratio  would  change  as  constituent- 
concentrations  are  reduced  through  combustion.  However,  it  was  felt' 
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that  using  these  volumes  would  produce  a  wastewater  more  difficult  to 
treat  and  would  therefore  be  conservative  from  a  contaminate  standpoint. 

Sample  development  consisted  of  combining  15  liters  of  distilled 
water,  3.375  liters  of  JP-4  aircraft  fuel,  and  0.075  liters  of  Ansul  3% 
AFFF  concentrate  in  a  24-liter  glass  carboy.  Using  a  magnetic  stir  bar 
and  base,  the  mixture  was  stirred  continuously  for  15  hours  to  promote 
hydrocarbon  transfer  to  the  water  phase.  After  mixing,  the  liquid  sat 
undisturbed  for  nine  hours  to  allow  for  segregation  of  free  fuel.  The 
synthetic  waste  was  siphoned  from  below  the  floating  fuel-AFFF  layer 
which  developed,  placed  in  2.5-liter  glass  bottles,  and  stored  in  the 
dark  at  4°C.  Samples  were  warmed  to  room  taiperature  immediately  prior 
to  testing  and  only  in  volumes  sufficient  for  the  work  at  hand. 

In  addition  to  the  synthetic  waste,  grab  samples  were  collected 
from  three  operational  FTFs.  One  came  from  Columbus  Air  Force  Base 
(CAFB),  Columbus,  Mississippi  while  two  were  obtained  from  different 
FTFs  located  at  Tyndall  Air  Force  Base  (TAFB) ,  Panama  City,  Florida. 
The  facilities  at  TAFB  consisted  of  an  older,  more  traditional  system 
''TAFB' 3  old  FTF)  and  a  new,  prototype  training  facility  (TAFB's  new 
FTF) .  Though  new  fire  training  facilities  will  use  many  of  the  design 
features  of  the  TAFB  prototype,  this  new  FTF  was  expected  to  generate 
somewhat  uncharacteristic  wastes  because  of  its  use  as  a  test  bed  for 
training  and  operational  procedures  and  experimental  chemicals. 

Samnles  were  obtained  from  lined  holding  ponds  that  follow 
oil/watei  separators  for  new  TAFB  FTF  and  CAFB's  facility.  The  sample 
from  TAFB's  older  facility  was  taken  from  the  last  stage  of  its 
oil/water  separator  in  that  no  holding  pond  exists.  In  all  cases. 
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samples  were  collected  about  four  weeks  after  a  training  exercise  and 
stored  in  gl^'’ss  carboys  in  the  dark  at  Only  volumes  of  ample 
sufficient  for  a  single  experiment  were  warmed  to  room  temperature 
immediately  before  testing. 

Wastewater  Characterization 

"everal  different  parameters  were  used  in  the  characterization  of 
the  wa-tewaters  tested  and  evaluation  of  process  efficiencies. 

Foam  Measurement  -  Determining  the  degree  of  foaminess  of  the 
waste  was  the  first  obstacle  overcome  in  the  design  cf  this  experiment. 
The  te  mique  allowed  comparative  evaluation  of  treatment  processes  and 
initial  waste  characterization.  For  characterization,  dilutions  were 
analyzed  to  determine  the  initial  strength  of  the  four  wastewaters. 
Each  dilution  was  developed  by  combining  distilled  water  with  the 
sample.  Dilution  of  the  sample  increased  until  reaching  a  concentration 
that  resulted  in  no  foam  production. 

Bikerman' s [lA]  technique  provided  the  basis  upon  which  the 
procedure  was  developed,  though  it  was  modified  for  use  in  measuring 
AFFF.  Bikerman  examined  ABS  (alkylbenzene  sulfonate)  which  is  very 
different  from  AFFF.  Through  experimentation,  it  was  determined  that 
AFFF  produces  more  foam  than  ABS  under  the  same  test  conditions.  The 
larger  volume  increased  the  sidewall  effects  for  Bikerman' s  prc''edure  to 
a  point  where  large  discontinuities  in  foam  consistency,  bubble  , ize  and 
and  foam  collapse  were  observed.  This  yield  in  highly  variable,  almost 
unreproducible  results  for  any  given  sample. 


29-19 


After  further  experimentation  with  glass  and  plastic  tubes  and 
chambers  of  differing  diameters  and  geometries,  a  four-liter  Nalgene 
graduated  cylinder  was  selected.  A  wheatstone  sparger  was  used  o 
disperse  the  air  as  it  was  added  to  the  bottom  of  the  sample.  A 
schematic  of  this  system  is  illustrated  in  Figure  1. 

Four  hundred  milliliters  of  the  sample  were  used.  The  sample  was 
carefully  metered  down  the  wall  of  the  graduated  cylinder  to  prevent  any 
foam  production.  After  the  sample  had  reached  quiescent  conditions,  it 
was  aerated  at  a  constant  rate  of  200  ml/min  for  A.O  minutes.  For  AFFF, 
the  200  ml/min  air  flow  rate  and  4-minute  time  interval  showed  excellent 
replicability  and  accuracy.  At  the  end  of  the  aeration  period,  the 
volume  of  foam  generated  was  measured  in  milliliters  and  recorded. 

Another  deviation  from  Bikerman's  technique  regards  the  use  of 
Sigma  as  a  means  of  expressing  the  foaminess.  By  establishing  a  con¬ 
stant  air  flow  rate  over  a  constant  time,  the  only  variable  in  the  Sigma 
relationship  is  that  of  the  foam  volume  generated.  Thus,  Sigma  did  not 
need  to  be  calculated.  Rather,  the  volume  of  foam  generated  was 
recorded.  It  is  felt  that  reporting  the  volume  of  foam  generated  makes 
it  easier  for  the  reader  to  perceive  foaminess. 

Bioinhibition  Studies  -  Even  if  the  aesthetic  problems  associated 
with  this  particular  waste  were  neglected,  the  waste  still  could  not  be 
treated  in  a  municipal  treatment  system  without  resolution  of  its  bioin- 
hibitory  nature.  To  quantify  the  toxic  threshold  of  the  waste,  thereby 
demonstrating  the  dilution  needed  for  discharge  to  a  POTW,  a  bioinhibi- 
t-ion  test  wa?  nin  on  raw  samples  from  each  source. 
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Figure  1 


Schematic  of  Foam  Testing  Apparatus 


4-liter 

Nalgene 

Graduated 

Cylinder 


To  determine  this  maximum  concentration,  the  Biochemical  Oxygen 
Demand  (BOD)  test  was  utilized. [2]  Different  volumes  of  each  wastes 
were  added  to  BOD  bottles.  A  sample  of  an  unacclimated  bacterial 
culture  was  placed  in  the  bottles.  This  bacterial  seed  was  developed  by 
collecting  sludge  from  a  local  wastewater  treatment  plant  and  feeding 
the  organisms  sodium  acetate  as  a  substrate.  Then,  the  bottles  were 
filled  with  dilution  water,  initial  dissolved  oxygen  measured,  and 
capped.  The  one-day  BOD  was  evaluated  and  results  evaluated  to  deter¬ 
mine  when  a  significant  reduction  in  bacterial  activity  occurred. 

Biodegradability  -  The  BOD  test  was  used  to  indicate  the 
biodegradability  of  the  treated  waste  samples.  The  general  equation  for 
the  exertion  of  oxygen  demand  during  this  test  is: 

BOD^  =  BODU  *  (1  - 

where  BOD^  equals  the  biochemical  oxygen  demand  observed  after  t  days  of 

incubation,  BODU  is  the  ultimate  (maximum)  BOD,  and  k,  is  the  base  e, 

d 

first-order  decay  rate  coefficient. 

Each  sample  was  analyzed  using  two  dilutions  set  up  in  triplicate. 
Dilution  water  samples  were  developed  using  two  seed  volumes  and 
evaluated  in  triplicate.  Samples  were  incubated  for  a  period  of  twenty- 
eight  days.  The  ultimate  BOD  (BODU)  and  the  decay  constant,  k^,  were 
determined  for  each  sample  using  a  non-linear,  least  squares  technique. 

An  acclimated  biological  culture  was  developed  for  each  of  the 
different  wastes.  Samples  of  a  fresh,  highly  diverse  population  of 
microorganisms  were  taken  from  the  grit  chamber  effluent  of  a  nearby 
POIV/.  The  samples  were  settled  and  supernate  diluted  with  distilled 


water.  Substrate  requirements  were  met  by  sodium  acetate  while  nutrient 
requirements  were  based  on  an  assumed  protoplasm  formulation  of 
^60^87*^23^12^'  Substrate  and  nutrients  were  provided  daily  and  the 
culture  was  continuously  aerated.  Acclimation  was  accomplished  slowly 
by  adding  increasing  volumes  of  wastewater  supplemented  with  nutrients. 
As  the  amount  of  wastewater  was  increased,  the  sodium  acetate  substrate 
was  reduced  until  eliminated.  By  the  end  of  two  weeks,  the  only  organic 
carbon  source  provided  was  the  particular  FTF  wastewater.  Solids 
generation  and  oxygen  uptake  illustrated  microorganism  viability. 

Organic  Content  -  Surfactants  are  organic  in  nature,  and  thus  the 
chemical  oxygen  demand  (COD)  test  was  used  to  indicate  the  total  oxidiz- 
able  carbon  in  the  wastewater.  The  technique  used  was  the  closed 
reflux,  titrimetric  method. [2] 

Other  Waste  Parameters  -  Other  measured  waste  characteristics 
included  pH,  alkalinity  and  total  suspended  solids  (TSS) .  All  were  per¬ 
formed  in  accordance  with  Standard  Methods. [2] 

Coagulation  Studies 

For  the  waste  at  hand,  removal  of  the  surfactants  would  all  but 
resolve  concerns  over  foaming  and  bioinhibition.  The  literature  indi¬ 
cates  that  surfactants  are  subject  to  the  same  destabilization  princi¬ 
ples  as  inorganic  sols  given  they  are  colloidal  emulsions.  Therefore, 
this  investigation  examined  coagulation  with  trivalent  aluminum  and 
trivalent  iron  for  destabilization  and  removal  of  AFFF  surfactants. 

The  experimental  procedure  was  basically  the  same  as  one  would  use 
for  an  investigation  of  the  coagulation  of  sols,  namely  the  "jar  test." 
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The  coagulants  chosen  were  aluminum  sulfate  (Al^ (SO^) ^ and  ferric 
chloride  (FeCl^).  When  pH  adjustment  was  required,  a  0.1  N  solution  of 
NaOH  or  a  0.1  N  solution  of  H^SO^  was  added.  A  500-ml  sample  size  was 
used  in  conjunction  with  a  Phipps-Byrd  six  paddle  laboratory  stirrer. 

Samples  were  rapid  mixed  on  a  magnetic  stirrer  for  2.5  minutes. 
To  provide  better  mixing  and  easier  measurement  of  pH,  the  magnetic 
stirrer  was  used  rather  than  the  paddle  stirrer.  Though  shorter  mixing 
times  are  common  for  water  treatment,  this  mixing  period  was  required  to 
provide  a  stable  pH.  After  mixing,  the  sample  was  immediately  placed  on 
the  paddle  stirrers  where  it  was  flocculated  for  15  minutes  at  30  rpm. 
Though  longer  times  are  typical  in  water  treatment,  the  15  minutes 
interval  was  chosen  to  offset  effects  of  the  longer  mixing  time.  Samples 
were  settled  for  30  minutes  after  flocculation.  Then  aliquots  of  the 
clarified  supernatant  were  extracted  for  characterization. 

Oxidation  Studies 

Two  oxidants  were  examined,  potassium  perm_nganate  and  hydrogen 

peroxide.  The  permanganate  was  fed  in  solution  form  as  was  the  ferrous 

2+ 

ammonium  sulfate  (FAS)  used  to  supply  Fe  catalyst  for  the  hydrogen 
peroxide  studies.  The  laboratory  procedure  consisted  of  a  series  of  jar 
tests  using  the  format  outlined  above.  The  pH  was  adjusted  immediately 
after  oxidant  addition  and  always  within  the  2.5  minutes  allotted  for 
rapid  mixing.  After  chemical  addition  and  mixing,  precipitant 
flocculation  and  settling  was  performed. 


RESULTS 


To  characterize  each  waste,  the  ultimate  BOD  and  first-order  decay 
constant,  the  foaming  potential,  the  COD,  the  bioinhibitory  level,  the 
total  suspended  solids,  the  pH,  the  dilution  to  eliminate  foaming,  and 
the  alkalinity  were  determined.  The  significance  of  each  was  discussed 
previously.  The  results  of  these  evaluations  are  tabulated  in  Table  1. 
A  brief  inspection  of  this  tabulation  clearly  indicates  that  the  four 
wastes  were  very  different.  This  adds  credibility  to  the  statement 
there  is  no  typical  waste  from  a  fire  training  facility.  Further,  com¬ 
parison  of  the  synthetic  waste's  characteristics  with  those  of  the  grab 
samples  from  actual  FTFs  indicates  a  marked  difference  in  properties. 
It  seems  clear  that  treatability  results  derived  from  the  synthetic 
waste  will  have  little  value  in  defining  the  treatability  of  actual 
wastes. 

The  synthetic  waste  was  found  to  have  i  tremendous  foaming  poten¬ 
tial.  Similarly,  the  dilution  required  to  remove  foaming  was  much 
greater.  These  facts  indicate  that  the  AFFF  concentration  of  this 
sample  is  much  greater  than  should  probably  be  expected  for  a  true  FTF 
wastewater.  The  BODU  and  COD  of  this  particular  sample  was  also  higher 
than  that  of  the  other  samples.  Yet,  the  decay  coefficient  is 
significantly  lower  indicating  a  poorer  biodegradability.  These  values 
can  also  be  attributed  to  a  higher-than-normal  AFFF  concentration. 
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Table  1 


Characterization  of  Wastewaters  from 
Firefighter  Training  Facility 


Wastewater  Sources  (*) 


Parameter 

(1) 

(2) 

(3) 

(4) 

Ultimate  BOD  (mg/1) 

138 

62 

295 

3170 

First-Order  Decay 

Coefficient,  (day  ) 

0.381 

0.533 

0.326 

0.150 

COD  (mg/1) 

171 

174 

480 

3790 

Total  Suspended  Solids  (mg/1) 

9 

35 

28 

0 

Initial  pH 

6.9 

7.1 

6.6 

6.1 

4 

Alkalinity  to  pH  =  5.10 
(mg/1  CaCO^) 

65 

172 

71 

4 

Bioinhibitory  Level 
(ml/liter) 

17 

200 

40 

33 

• 

Maximum  Foaming  Potential 
(ml  per  1000  mis) 

310 

1375 

375 

2375 

Dilution  to  Eliminate  Foaming 
(ml  diluted  to  1  liter) 

625 

125 

187 

8 

• 

*  -  (1)  TAFB  New  Firefighter  Training  Facility 

(2)  TAFB  Old  Firefighter  Training  Facility 

(3)  CAFB  Firefighter  Training  Facility 

(4)  Synthetic  Waste 
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For  comparison,  the  ultimate  BOD  of  AJFF  ranges  from  300,000  mg/1 
to  510,000  mg/1.  Its  ranges  from  500,000  mg/1  to  730,000  mg/1.  [27] 
Based  on  these  values,  th»»  mass  of  AFFF  used  in  synthetic  waste 
development  and  assuming  complete  AFFF  dissolution,  the  COD  should  range 
between  2,500  and  3,650  mg/1.  Similarly,  the  BODU  should  fall  in  a 
range  1,500  and  2,500  mg/1.  Noting  complete  dissolution  did  not  occur, 
one  can  assume  that  the  difference  between  the  actual  and  hypothetical 
BODU  and  COD  values  can  be  attributed  to  the  water-miscible  fuel  hydro¬ 
carbons  present  in  the  waste. 

It  was  hoped  that  a  correlation  could  be  developed  between  the 
bioinhibitory  levels  and  the  foaming  potential  of  each  waste.  After 
inspecting  the  data  presented  in  Table  1,  it  appears  there  is  no 
correlation  between  these  two  parameters.  This  may  be  due  to  different 
concentrations  of  water-miscible  fuel  hydrocarbons  present  in  the 
different  wastewaters.  Also,  the  presence  of  unidentified  components 
could  have  an  impact. 

The  low  alkalinity  of  the  synthetic  waste  can  be  attributed  to 
distilled  water  being  used  for  its  development.  Similarly,  the  water 
used  at  FTFs  would  be  the  primary  source  of  this  parameter.  The  pH  val¬ 
ues  are  consistent  with  alkalinity. 

The  total  suspended  solids  content  of  the  wastewaters  from  CAFB 
and  TAFB's  old  FTF  were  appreciable  while  the  synthetic  waste  and  the 
TAFB's  new  FTF  wastewater  were  almost  devoid  of  TSS.  Observations  at 
each  facility  confirmed  this  with  water  in  the  TAFB's  old  FTF's 
oil/water  separator  laden  with  bacteria  and  algae.  The  water  in  the 
pond  at  CAFB's  FTF  had  suspended  clay  particles  which  apparently 
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resulted  from  the  new  liner  placed  on  the  pit  area.  The  low  level  of 
solids  found  for  the  new  TAFB  facility  might  be  attributed  to  several 
factors.  One  would  be  that  the  oil/water  separator  has  about  four  times 
the  surface  area  of  CAFB's  FTF  and  is  a  newer  design  then  that  of  TAFB's 
old  FTF,  resulting  in  better  solids  separation.  Another  factor  might 
b.i  that  the  new  TAFB  FTF  had  been  used  for  only  a  few  months  prior  to 
sample  collection,  providing  insufficient  time  for  an  algae  population 
to  develop. 

This  brings  up  another  reason  for  the  lack  of  correlation  between 
foaming  and  bioinhibition.  If  the  surfactants  in  AFFF  were  to  adsorb  to 
the  sooty  particulate  matter  present  in  some  of  the  wastes,  its 
effectiveness  at  generating  foam  would  be  impaired.  At  the  same  time, 
microbial  organisms  would  be  attracted  to  these  same  sites,  thereby 
coming  nearer  the  surfactant  materials.  If  this  is  true,  the  foam  test 
alone  will  never  be  a  good  indication  of  the  surfactant  concentration 
present  in,  or  the  inhibitory  strength  of,  a  waste. 

Chemical  Coagulation 

Initial  evaluation  of  alum  coagulation  used  the  synthetic  waste- 
water.  Due  to  its  low  alkalinity,  some  pH  adjustment  to  achieve  optimum 
treatment  was  anticipated.  However,  to  obtain  an  initial  indication  of 
optimum  coagulant  dose,  the  first  test  was  performed  without  pH  modifi¬ 
cation.  This  test  found  that  foaming  potential  was  eliminated  at 

3  + 

dosages  above  AO  mg/1  as  A1  .  It  was  noted  that  at  this  and  greater 
alum  dosages  no  sludge  was  produced  though  the  wastewater  turned  milky 
white  due  to  pin  floe  development.  It  was  also  noted  that  the  pH  fell 
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below  5.0  units  at  or  above  40  mg/l  as  A1  which  is  outside  the  optimum 
insolubility  range  of  aluminum  hydroxide. 

Additional  testing  with  alum  was  performed  but  with  the  final  pH 
of  the  reaction  solution  being  adjusted  to  various  levels  with  0.1  N 
NaOH.  According  to  the  literature,  a  final  pH  of  5.7  units  would  ensure 
optimum  coagulation  and  better  sludge  production.  Table  2  summarizes 
results  from  the  tests  performed.  From  this  data  one  notes  that,  at  pH 
levels  above  5.0  units,  the  foaminess  of  the  wastewater  did  not  vary 
with  alum  dose.  During  these  experiments  sludge  production  was  never 
obsejTved  though  a  fine  pin  floe  did  consistently  form. 

From  this  one  can  conclude  that  coagulative-based  removal  of  the 
AFFF  surfactant  was  not  providing  the  foaminess  reduction  observed  pre¬ 
viously.  However,  the  values  in  Table  2  would  indicated  that  pH  adjust¬ 
ment  alone  was  insufficient  to  produce  this  effect.  This  was  confirmed 
by  additional  tests  with  sample  pH  being  adjusted  between  3.0  and  11.0 
units  and  no  coagulant  addition. 

Throughout  the  performance  of  these  coagulation  studies  the  floe 
generated  appeared  light  and  fluffy  with  poor  settling  characteristics. 
Referred  to  as  pin  floe,  it  can  result  from  long  mixing  times  if  energy 
is  added  to  the  point  of  destroying  developed  particles.  It  is  also  a 
good  indication  of  improper  pH. 

Coagulation  of  the  synthetic  waste  with  ferric  chloride  as  the 
coagulant  was  also  attempted.  Again,  the  foam  inhibition  effect 
observed  in  the  alum  studies  was  encountered.  In  particular,  dosages 
above  50  mg/l  as  Fe^^  cut  out  foaminess.  However,  foam  was  not 
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Table  2 


Results  for  Coagulation  with 
Aluminum  Sulfate  of  Synthetic  Waste 


Volume  of  Foam  Generated  (ml/1) 


Dosage  Alj^ 
(mg/1  A1  ^) 

Not 

Adjusted 

4.9 

5.7 

5.3 

0 

2375 

2375 

2375 

2375 

20 

2000 

1875 

23  75 

23  75 

25 

ic 

* 

2375 

2375 

30 

★ 

1750 

23  75 

23  75 

i 

35 

* 

* 

2375 

* 

40 

750 

450 

2375 

23  75 

45 

* 

* 

2375 

* 

50 

* 

0 

* 

2375 

60 

0 

0 

2375 

ic 

80 

0 

0 

* 

23  75 

« 

100 

0 

0 

2375 

* 

Value  not  reported  due  to  data  collection  error  or 
analysis  not  being  performed. 
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inhibited  until  the  pH  fell  below  4.4.  Further,  pin  floe  did  not 
develop  during  these  jar  tests. 

Because  sludge  development  did  not  occur,  it  was  anticipated  that 
the  organic  content  or  biodegradabilit^'  of  this  waste  would  not  change. 
The  results  of  the  COD  and  BOD  tests  confirmed  this.  The  data  from  the 
bioinhibitory  test  similarly  indicated  no  significant  change  in  this 
parameter. 

At  this  point  testing  of  the  samples  collected  from  the  three  FTFs 
began.  In  each  case,  the  wastewater  was  tested  initially  with  varying 
dosages  of  aluminum  sulfate,  without  pH  adjustment.  A  summary  of  these 
initial  tests  is  presented  as  Table  3.  From  inspection  of  this 
tabulation  one  can  see  that  the  pH-foaminess  relationship  existed  when 
alum  was  added.  Foam  production  was  nonexistent  when  pH  reached  the  5.0 
level.  Tests  involving  pH  adjustment  with  and  without  coagulant  were 
performed  on  each  sample.  The  results  confirmed  that  the  phenomena 
observed  previously  for  the  synthetic  waste  exists  for  the  three  FTF 
wastes  as  well.  However,  there  were  some  reaction  differences  which 
should  be  noted. 

When  the  CAFB  wastewater  sample  was  treated  with  alum  dosages  of 
between  40  and  80  mg/1  as  Al^^,  good  floe  formation  was  observed  and 
removal  of  solids  was  achieved.  Noting  the  increase  in  foaminess  which 
occurred  for  this  level  of  treatment,  it  would  appear  that  the  suspended 
solids  in  the  wastewater  broke  the  surface  tension  created  by  the 
surfactants.  This  would  greatly  inhibited  the  true  foaming  potential  of 
the  CAFB  waste  stream.  By  destabilization  and  removing  these  solids  the 
surface  tension  decreased,  allowing  an  associated  increase  in  foam 
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Table  3 


Foam  Generation  Results  from  the 
Aluminum  Sulfate  Coagulation  Studies 
of  Firefighter  Training  Facility  WastewaterS 


CAFE'S 

FTF 

TAFB's 

Old  FTF 

TAFB's 

New  FTF 

Dose 

Alum 

(mg/1  Al^  ) 

Foam 

Produced 

Final 

Foam 

Produced 

Final 

Foam 

Produced 

Final 

(ml/l) 

pH 

(ml/1) 

pH 

(ml/1) 

pH 

0 

375 

6.5 

1375 

7.0 

310 

6.9 

20 

438 

6.5 

* 

A 

* 

it 

40 

3875 

6.2 

★ 

ic 

* 

* 

50 

1000 

6.2 

★ 

★ 

3  75 

6.8 

60 

1375 

6.2 

it 

* 

* 

* 

80 

13  75 

6.0 

* 

ic 

* 

ic 

100 

1125 

5.9 

1500 

6.82 

438 

6.5 

125 

875 

5.9 

* 

* 

* 

* 

150 

375 

5.5 

* 

* 

125 

5.4 

160 

250 

5.4 

* 

* 

* 

* 

170 

125 

5.3 

* 

* 

* 

* 

175 

0 

5.2 

* 

■it 

* 

* 

200 

0 

4.9 

1500 

6.4 

0 

4.7 

300 

0 

4.6 

1500 

6.3 

* 

* 

350 

* 

* 

750 

6.1 

* 

* 

400 

0 

4.5 

500 

5.8 

* 

* 

415 

* 

* 

0 

5.1 

* 

* 

450 

* 

* 

0 

4.8 

* 

* 

*  -  Value  not 

reported  d 

ue  to 

data  collection  error 

or  analysis 

not 

performed. 


generation.  This  continues  until  the  pH  inhibiting  effect  begins  to 
take  over. 

With  regard  to  the  organics  present,  COD  was  reduced  by  about  25 
percent  at  a  dosage  of  dO  mg/l  as  Al^^.  There  was  no  decrease  in  the 
ultimate  BOD  and  the  decay  rate  remained  effectively  the  same.  Bioinhi¬ 
bition  characteristics  of  this  waste  similarly  remained  consistent.  It 
would  therefore  appear  that  the  solids  removed  are  chemically  oxidizable 
but  not  readily  degradable  by  microorganism. 

The  total  suspended  solius  concentration  of  the  wastewater 
generated  by  the  older  FTP  at  TAFB  was  similar  to  the  CAFB  waste.  It 
appears  that  there  was  sufficient  alkalinity  for  the  aluminum 
precipitation  reaction.  When  tested,  only  a  minimal  increase  in  the 
foam  production  was  realized  with  the  removal  of  the  suspended  matter. 
The  higher  initial  alkalinity  retarded  pH  depression  and  required  a 
large  arum  dosage  to  eliminate  the  foaming  potential.  Again,  this  point 
was  reached  at  a  pH  of  about  5  units. 

For  the  waste  from  the  old  FTF  at  TAI'B  ,  COD  was  reduced  by  about 

3+ 

25  percent  at  what  appeared  to  be  an  optimum  dose  of  200  mg/l  as  A1 
However,  the  BODU  of  this  sample  was  reduced  uy  fifty  percent  and  the 
decay  coefficient  was  reduced  to  0.36  days  This  can  be  attributed 

to  the  solids  contained  in  this  waste  being  microorganisms  rather  than 
the  clayish  material  found  in  the  CAFB  waste. 

The  wastewater  generated  at  the  new  training  facility  at  TAFB  was 

characteristically  different  from  the  others.  When  the  pH  was  reduced 

3t  .  . 

to  4.7  with  an  alum  dose  of  200  mg/l  as  A1  ,  a  light  non-settling  pin 
floe  developeu  similar  to  that  observed  during  the  tests  with  the 
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synthetic  waste.  Other  similar  results  included  foam  production 
stopping  with  BOD  and  COD  levels  effectively  unchanging  at  this  alum 
dose. 

A  method  that  may  be  applied  to  enhance  coagulation  is  the 
addition  of  a  weighting  agent.  The  coagulant  aid  chosen  was  kaoline,  an 
adsorptive  clay  which  is  readily  available  and  often  used  in  the 
production  of  potable  waters.  A  series  of  jar  tests  were  set  up  to  see 
if  addition  of  the  clay  would  produce  surfactant  removal.  Samples  of 
synthetic  waste  and  waste  from  TAFB's  new  FTF  were  used.  The  result  was 
that  a  very  good  floe  formed  which  provided  for  good  solids  separation. 
However,  foaminess  was  not  reduced  nor  was  the  COD  of  the  treated 
waters. 

Coagulation  of  the  FTF  wastewaters  with  ferric  chloride  proved 
fruitless.  As  with  the  synthetic  waste,  using  this  coagulant 
necessitated  lower  pH  levels  than  was  needed  with  alum  to  end  foaming. 
Solids  separation  results  were  similar  to  those  of  alum  with  good 
removal  for  the  wastewaters  from  the  CAFB  and  TAFB's  older  facilities. 

Chemical  Oxidation 

The  initial  ^valuation  of  hydrogen  peroxide  as  a  chemical  oxidant 
was  performed  on  the  synthetic  waste.  At  the  outset,  ^2^2  rang- 
ing  from  0.5  to  50  mg/1  were  examined  without  pH  adjustment  or  Fe  cat¬ 
alyst  addition.  These  tests  were  to  determine  if  a  threshold  or  an 
optimum  dose  existed  within  this  range.  Unfortunately,  a  measurable 
change  in  waste  characteristics  did  not  take  place  over  this  range  of 
H2O2  addition.  Initially  it  was  thought  that  greater  dosages  might  be 
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required  to  offset  an  unknown  competing  reaction.  However,  titration 
with  ferrous  ammonium  sulfate  indicated  that  an  oxidant  residual  existed 
at  the  lowest  dose  level  after  one  hour.  It  was  therefore  surmised  that 
oxidation  with  hydrogen  peroxide  was  not  possible  under  natural  condi¬ 
tions. 

The  literature  indicates  that  an  acidic  environment  is  beneficial 

for  peroxide  oxidation  of  alkanes.  To  test  if  pH  was  a  factor,  samples 

of  the  synthetic  waste  were  dosed  with  10  mg/1  of  ^^02  and  the  pH  was 

varied  from  3.0  and  11.0  units.  Once  again,  no  significant  change  in 

foaminess,  COD,  or  BODU  were  noted. 

The  last  test  of  hydrogen  peroxide  oxidation  of  the  synthetic 
2+ 

waste  examined  Fe  catalyst  addition.  A  dose  of  10  mg/1  of  ^^02  was 

added  to  several  samples.  A  pH  of  4.0  units  was  used  for  the  reaction, 

as  based  on  the  literature.  This  adjustment  was  made  immediately  after 

hydrogen  peroxide  addition.  Addition  of  FAS  solution  followed  pH 

2+ 

adjustment.  Fe  catalyst  concentrations  up  to  50  mg/1  were  examined. 

The  results  of  this  experiment  were  generally  unimpressive.  At 

catalyst  levels  above  35  mg/1,  final  waste  characteristics  did  not 

change  significantly.  It  was  noted  that  sedimentation  of  an  iron 

2+ 

precipitant  became  significant  for  the  samples.  Below  35  mg  Fe  /I  a 

2+ 

reduction  in  COD  was  observed.  A  catalyst  dose  of  12  mg  Fe  /I  yielded 
the  greatest  COD  removal.  For  this  dosage,  COD  was  reduced  by  24.2 
percent,  BOD  reduction  was  20.0  percent  and  the  decay  coefficient 
remained  about  the  same  at  0.16  day  However,  foaminess  was 

effectively  unchanged  over  the  entire  range  of  FAS  additions. 
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For  the  three  FTF  samples  collected  for  this  project,  somewhat 
limited  testing  of  hydrogen  peroxide  oxidation  was  performed.  A  solu¬ 
tion  pH  of  4.0  units  was  used.  A  catalyst  to  hydrogen  peroxide  ratio  of 
1.2  was  used  for  all  samples  and  was  based  on  the  optimum  value  for  the 
synthetic  waste.  The  tests  evaluated  dosages  ranging  from  1  to  10 
mg/1.  Once  again,  none  of  the  samples  showed  any  reduction  in  foam  gen¬ 
eration  after  treatment.  COD  removal  efficiencies  were  low.  For  these 
reasons,  BOD  was  not  performed.  These  results  would  seem  to  eliminate 
hydrogen  peroxide  from  consideration  as  an  effective  oxidizing  agent  for 
the  pretreatment  of  FTF  wastewaters. 

The  synthetic  wastewater  was  the  first  sample  examined  to  evaluate 
the  potential  usefulness  of  potassium  permanganate  in  pretreatment. 
These  studies  examined  the  addition  of  KMnO^  up  to  a  level  of  90  mg/1. 
The  data  showed  that  COD  removal  of  21.8  percent  was  achieved  at  the  40 
mg/1  KMnO^  level  and  that  doses  above  this  did  not  significantly  reduce 
the  level  of  this  parameter  further.  Below  the  40  mg/1  KMnO^  level,  the 
BOD  and  the  decay  coefficient  were  reduced.  Above  this,  residual  KMnO^ 
inhibited  bacterial  activity. 

It  therefore  seems  that  the  potassium  permanganate  reacted  with 
some  of  oxidizable  organics  which  were  also  biodegradable.  However, 
once  these  materials  were  stabilized,  it  appears  that  the  reaction 
kinetics  changed,  slowing  the  rate  of  oxidation  significantly.  Further, 
foam  generation  for  all  of  the  samples  was  unchanged.  It  would 
therefore  appear  that  the  surfactants  were  not  involved  in  the  more 


rapid  oxidation  reaction. 


The  CAFB  sample  was  exposed  to  a  similar  set  of  tests  though  there 
was  a  significant  difference  in  results.  The  data  from  these  experi¬ 
ments  are  presented  Table  4.  In  short,  potassium  permanganate  doses  of 
20  mg/1  and  greater  resulting  in  a  reduced  BODU  concentration.  For  a 
measurable  COD  reduction,  30  mg/1  was  required.  It  is  also  interesting 
to  note  an  increase  in  apparent  biodegradability,  as  indicated  by  the 
decay  coefficient,  at  the  10  mg/1  dose  level.  No  foam  was  developed 
from  any  of  the  treated  samples. 


Table  4 


Potassium 

Permanganate 

Oxidation 

of  CAFB  Wastewater 

Chemical  Dose 
(mg/1  KMnO^) 

Ultimate 

COD  BOD 

(mg/1)  (mg/l) 

Decay 

Coefficient 
(day  ) 

Foam 

Produced 

(ml/1) 

0 

537 

313 

0.316 

0 

10 

531 

310 

0.377 

0 

20 

535 

268 

0.255 

0 

30 

445 

122 

0.158 

0 

40 

484 

129 

0.131 

0 

This 

is  the 

desired  result 

from 

pretreatment 

of  this  waste  before 

biological 

treatment  in  a  POTW. 

However, 

more  than  oxidant  addition 

must 

have 

played 

a  role.  At 

the 

10 

mg/l 

level  the  dose/COD  ratio  for 

this 

waste 

was  0 . 

019  mg/mg. 

At 

the 

same 

time  the 

S3mthetic  waste  had 

been 

exposed  to 

a  0,024  mg 

dose/mg 

COD 

and  had 

not  produced  these 

results. 
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Reevaluation  of  the  data  indicated  that  the  final  pH  of  the  syn¬ 
thetic  waste  samples  ranged  between  6.8  for  the  lowest  dose  to  7.7  at 
the  highest  dose.  The  CAFE  samples  had  final  pHs  of  about  6.6.  Prior 
to  this  discovery,  pH  adjustment  was  not  considered  because  of  the 
desirably  neutral  level  of  the  raw  waters  and  the  intent  to  only  pre¬ 
treat.  However,  a  wastewater  having  a  pH  of  6.5  would  be  acceptable  for 
discharge  into  a  POTO  under  current  USEPA  guidelines  and  the  literature 
does  indicate  that  acidic  conditons  enhance  the  permanganate  oxidation 

process.  Therefore,  the  synthetic  waste  was  retested  using  a  final  pH 

4 

of  6.5.  Under  these  conditions,  a  level  of.  KMnO  addition  was  observed 
which  eliminated  the  foaming  potential  of  the  synthetic  waste. 

It  appears  from  these  experiments  that  a  40  mg/1  of  KMnO^  is 
sufficient  to  provide  for  the  destruction  of  the  foam  generation 
potential  of  the  synthetic  wastewater.  It  should  be  noted  that  at  the 
permanganate  doasge  which  abated  foam  generation,  COD  had  been  reduced 
only  by  about  six  percent  and  that  there  was  virtually  no  removal  of 
BOD.  Further,  the  decay  coefficient  had  increased  only  slightly  at  this 
treatment  level.  Table  5  contains  the  values  of  these  parameters  at  the 
dose  of  KMnO^  which  eliminated  foaminess.  The  values  for  foam  genera¬ 
tion  potential  as  determined  during  these  experiments  is  presented  in  as 
Figure  2. 

Given  these  results,  the  samples  collected  from  the  two  TAFB 
facilities  and  from  CAFB  were  evaluated  for  potassium  permanganate  oxi¬ 
dation  at  a  pH  )£  6.5  units.  Figures  3  through  5  illustrate  the  test 
results  on  foaming.  Using  these  data,  optimum  dosages  have  been  defined 
and  are  summarized  in  Table  5.  The  waste  characterization  for  these 
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Table  5 


Results  of  Potassium  Permanganate  Addition  to  Wastewaters 
from  Firefighter  Training  Facilities  (Final  pH  =  6.5) 

Wastewater  Sources  (*) 


Parameter 

(1) 

(2) 

(3) 

(4) 

KMnO,  Dose  Required  to 

Eliminated  Foaming  (mg/1) 

4 

10 

5 

40 

COD  (mg/1);  Initial 

164 

170 

465 

3660 

Final 

158 

156 

455 

3440 

Ultimate  BOD  (mg/1) ;  Initial 

133 

53 

270 

3090 

Final 

127 

58 

250 

3060 

Decay  Coefficient,  k, (day  ^); 

Initial 

0.38 

0.53 

0.33 

0.15 

Final 

0.39 

0.48 

0.32 

0.15 

*  -  (1)  TAFB  New  Firefighter  Training  Facility 

(2)  TAFB  Old  Firefighter  Training  Facility 

(3)  CAFB  Firefighter  Training  Facility 

(4)  Synthetic  Waste 
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Foam  Conoratod  por  400— ml  Sampio  (mis)  Foam  Ganorotod  par  400-ml  Sample  (mis) 

(Thousands) 


Figure  2 

Foam  Produoad  Varsuo  Parmengonata  Doaa 


Potaaatum  Parmanganata  Doaaga  (mg/I) 


Figure  3 

Feam  Produoad  Varouo  Parmanganata  Doaa 
CAFB's  FTP'S  Wastawatar.  Hnal  pH  m  g.s 


Potaaslum  Parmanganata  Doaaga  (mg/I) 
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Foam  C«n«rat*d  p«r  400— ml  Sampla  (ml*)  Foam  C*n*rat*d  p*r  400— ml  SompI*  (ml*) 
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levels  of  potassium  permanganate  addition  is  also  presented  in  this 
tabulation. 

From  these  experiments,  it  has  been  found  that  potassium  perman¬ 
ganate  is  effec  ive  at  destroying  the  foaming  potential  of  FTF  waste- 
waters  if  the  pH  of  the  solution  is  6.5  units.  At  the  same  time,  there 
does  not  seem  to  be  a  significant  shift  in  organic  content  (COD)  or 
biodegr  idability  (Ultimate  BOD  and  decay  rate). 
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CONCLUSIONS  AND  RECOMMENDATIONS 


The  experiments  presented  in  this  report  examined  the  possibility 
of  reducing  the  foaming  potential  firefighter  training  facility 
wastewaters.  The  improvement  in  its  biodegradability  was  considered  a 
desirable  side  effect. 

Potassium  permanganate  addition  coupled  with  pH  control  has  been 
found  successful  at  reducing  the  foaming  potential  of  wastewaters 
generated  by  FTFs.  No  significant  sludge  was  produced,  foaming  was 
permanently  eliminated,  and  biodegradability  may  actually  be  enhanced. 
The  variability  of  the  characteristics  of  these  wastes  impacts 
pretreatment  needs  and  chemical  requirements.  Therefore,  studies  will 
have  to  be  performed  on  a  case-by-case  basis  before  wastewater  discharge 
to  a  POTW. 

Coagulation  for  the  removal  of  AFFF  surfactants  was  found  to  have 
little  if  any  application  in  pretreating  these  wastewaters.  A  foam 
inhibition  effect  was  found  early  in  the  experimental  process.  This 
occurred  when  pH  of  the  solution  fell  into  the  range  of  4.8  to  5.1  upon 
the  addition  of  alum.  However,  adjusting  the  pH  out  of  this  range,  as 
would  be  required  before  waste  discharge  to  a  POIV,  resulted  in  the 
foaming  potential  of  the  waste  being  reinstated. 

The  samples  collected  “^rom  the  CAFE  and  older  TAFB  facilities 
contained  solids  which  were  readily  coagulated,  flocculated  and  removed 
through  sedimentation.  This  validated  the  mixing,  flocculation,  and 
sedimentation  procedures  used  during  this  investigation.  During 
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coagulant  addition,  foaming  potential  of  these  samples  increased  to  a 
point  of  inflection  after  which  the  foam  production  decreased  to  zero. 
It  appears  that  the  suspended  solids  in  e  water  broke  the  surface 
tension  of  the  bubble  created  by  the  AFFF  surfactants.  With  removal  of 
this  matter,  the  surface  tension  and  foam  production  increased.  The 
inflection  point  of  the  curve  occurs  when  the  pH  reaches  the  level  for 
foaming  inhibition. 

The  synthetic  sample  and  the  wastewater  from  TAFB's  new  FTF 
contained  no  significant  solids  and  coagulation  was  unable  to  produce  a 
floe  particle  that  would  settle.  Kaoline  was  added  as  a  weighting  agent 
and  particle  sedimentation  did  occur.  This  addition  did  not  improve  the 
post-treatment  characteristics  of  the  wastewaters.  It  was  observed  that 
with  the  kaoline  in  suspension  the  foaminess  of  the  waste  was  reduced. 
However,  once  solids  separation  had  been  achieved,  the  foaming  potential 
returned  to  its  original  level. 

The  BOD  data  for  the  samples  treated  with  alum  and  ferric  chloride 
showed  no  increase  in  the  biodegradability  of  the  wastes  after 
treatment.  This  would  be  expected  given  that  no  solids  of  significance 
were  removed. 

Studies  identified  the  level  of  waste  dilution  at  which  foaming 
potential  was  eliminated.  This  data  appeared  to  be  first-order  in 
function  and  an  effort  to  regress  an  equation  relating  initial  foaming 
potential  to  dilution  was  made.  Unfortunately,  the  confidence  contours 
were  so  large  that  no  distinction  could  be  made  between  the  functions. 
This  can  be  attributed  to  a  limited  data  base. 
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It  is  known  that  the  biological  inhibition  is  dependent  on  the 
surfactant  concentration  present  in  the  wastes.  The  level  of  wastewater 
dilution  needed  to  eliminate  foam  production  should  have  correlated  with 
the  dilution  below  which  bioactivity  was  unimpeded.  The  data  did  not 
correlate  in  this  manner.  This  can  be  attributed,  in  part,  to  the  fact 
that  foam  generation  is  sensitive  to  suspended  solids  for  any  given 
concentration  of  AFFF. 

Based  on  the  results  of  this  investigation,  several  questions  seem 
to  require  further  study.  Some  of  these  are  listed  below. 

1.  What  is  the  affect  pH  has  on  permanganate  oxidation  of  this 
waste? 

2.  What  impact,  permanent  or  temporary,  do  inert  and  biological 
solids  have  on  the  foaming  potential  of  FTF  wastes? 

3.  Can  peroxide  oxidation  be  enhance  with  UV  or  will  the  spectral 
ban  of  the  surfactants  impact  this  process? 

4.  Can  a  relationship  between  solids,  foaming  potential,  and/or 
dilution  required  to  eliminated  foaming  be  developed  which 
predicts  surfactant  concentration?  Predicts  bioinhibitory 
concentration? 
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ABSTRACT 

The  research  described  in  this  report  was  conducted  to  examine  the  relationship 
between  compaction  moisture  content,  dynamic  stress  transmission  and  soil  microstructure 
in  compacted  unsaturated  Ottawa  20-30  sand  Recent  research  has  shown  that  moisture 
conditions  during  compaction  can  increase  the  stress  transmission  ratio  measured  in 
dynamic  impact  tests  by  as  much  as  a  factor  of  two  and  can  also  lead  to  increased  stress 
wave  propagation  velocities.  Other  studies  have  demonstrated  that  both  the  method  of 
compaction  and  the  amount  of  moisture  present  during  compaction  have  a  measurable 
influence  on  both  the  static  and  dynamic  properties  of  sands.  Experimental  evidence 
suggests  such  behavior  can  be  attributed  to  variations  in  soil  microstructure  and 
compressibility  as  a  result  of  conditions  during  compaction.  However,  a  clear  and  concise 
explanation  of  the  phenomenon  and  the  interrelationship  among  various  parameters  is  not 
currently  available. 

Split-Hopkinson  Pressure  (SHPB)  tests  were  conducted  on  compacted  moist 
specimens  of  Ottawa  20-30  sand  tested  moist  and  after  oven  drying.  The  SHPB  results 
indicate  that  the  moisture  present  during  compaction  has  a  significant  effect  on  stress 
transmission  even  for  specimens  compacted  moist  and  tested  dry.  These  results  also 
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suggest  that  moisture  is  an  important  factor  contributing  to  the  development  of  soil 
microstructure  and  soil  stiffness  during  compaction.  In  addition,  compacted  specimens 
were  epoxied  and  sectioned  for  microstructural  analysis  using  an  approach  based  on 
standard  petrographic  procedures  available  in  the  open  literature.  The  preliminary  results 
obtained  to  date  indicate  that  some  preferred  particle  orientations  do  exist  with  variations  in 
moisture  conditions  at  compaction  which  could  affect  soil  behavior.  However,  no  strong 
preferential  orientation  has  been  observed  in  the  data  so  far.  Further  detailed 
microstructural  studies  are  currently  being  conducted  and  will  be  the  subject  of  future 
reports  and  publications  by  the  authors. 
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I.  INTRODUCTION 


The  prediction  of  ground  motions  from  explosive  detonations  and  their  effects  on 
structures  requires  information  about  the  response  of  geologic  materials  to  intense  transient 
loadings.  Both  laboratory  and  field  investigations  have  provided  insight  into  the  stress 
wave  propagation  characteristics  of  soils.  Of  particular  interest  is  the  ability  of  a  soil  to 
transmit  applied  dynamic  stresses.  Since  soil  is  a  multiphase  media,  in  the  general  case 
(eg.  solids,  water  and  air),  its  static  and  dynamic  behavior  is  very  complex.  The  general 
nature  of  stress  wave  propagation  in  particulate  media  such  as  soils  depends  on  a  number 
of  parameters,  the  interrelationship  between  which  is  not  fully  understood.  Soil 
microstructure  is  affected  by  the  degree  of  saturation  (moisture  condition),  soil  density, 
effective  stress,  applied  stress  intensity,  stress  history  and  the  nature  of  the  material  itself 
(e.g.  particle  size,  shape,  distribution,  mineralogical  constituent(s),  etc.). 

The  ability  of  a  soil  to  transmit  applied  dynamic  stresses  (energy)  is  of  particular 
interest  to  the  U.S.  Air  Force  with  respect  to  military  protective  construction  and 
survivability  designs.  Typical  engineering  analyses  assume  that  little  or  no  material 
property  changes  occur  under  dynamic  loadings,  and  in  addition,  analyses  do  not  account 
for  the  effects  of  saturation  (moisture  conditions)  on  energy  transmission  in  soils.  This  is 
primarily  due  to  an  incomplete  understanding  of  the  behavior  of  soils  under  transient 
loadings  and  uncertainties  about  field  boundary  conditions. 

Current  analysis  and  prediction  methods  generally  use  material  properties  data 
based  on  conventional  weapons  detonations  in  dry,  or  to  a  limited  extent,  saturated  soils. 
Results  from  U.S  Air  Force  field  and  laboratory  tests  with  explosive  detonations  in  soils 
have  shown  that  material  property  changes  do  in  fact  occur  and  that  variations  in  soil 
stiffness  (or  compressibility)  significantly  affect  both  dynamic  and  static  stress 
transmission.  However,  there  are  currently  no  theoretical,  empirical  or  numerical  methods 
available  for  predicting  large  amplitude  compressive  stress  wave  velocity  and  stress 
transmission  in  unsaturated  soils  (Crawford  et  al.,  1974;  Ross  et  al.,  1986;  WES,  1984). 
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Dynamic  stress  transmission  has  been  shown  to  be  dependent  upon  the  moisture 
content  and  boundary  conditions  present  daring  compaction  (Ross  et  al.  1986;  Ross  1989; 
Veyera  1989).  It  is  believed  that  the  observed  behavior  can  be  attributed  to  variations  in 
soil  microstructure  developed  as  a  consequence  of  moisture  present  during  compaction. 
Considering  this,  the  research  outlined  herein  was  performed  as  a  part  of  the  1990 
Research  Initiation  Program  (RIP)  to  obtain  information  on  the  development  of 
microstructure  in  compacted  unsaturated  sand  and  its  relationship  to  stress  transmission 
behavior.  A  series  of  tests  were  conducted  to  assess  the  stress  transmission  characteristics 
of  compacted  moist  Ottawa  20-30  sand.  In  addition,  a  detailed  examination  of  the 
microstructure  formed  during  compaction  was  initiated.  Specific  microstructural 
parameters  of  interest  include  particle  orientations,  void  space  orientation,  and  percentage 
void  space  as  a  function  of  moisture  content  (saturation)  and  confinement  conditions  and  in 
particular,  the  spatial  variations  occurring  on  vertical  and  horizontal  planes.  Procedures 
available  in  the  open  literature  for  studying  planar  pore  and  grain  patterns  have  been 
adopted  as  a  basis  for  analyzing  the  data  obtained. 

The  results  of  studies  such  as  this  will  lead  to  a  better  fundamental  understanding  of 
the  role  of  microstructure  as  it  affects  the  macroscopic  engineering  behavior  of  soils.  The 
microstructural  characterization  of  unsaturated  soils  will  be  a  key  element  in  establishing 
and  developing  an  understanding  of  stress  transmission  in  unsaturated  soils  and  will  have 
direct  applications  to  groundshock  prediction  techniques  including  stress  transmission  to 
structures. 
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II.  RESEARCH  OBJECTIVES 


The  study  described  herein  was  conducted  to  examine  the  development  of 
microstructure  in  compacted  moist  sand  and  its  influence  on  dynamic  stress  transmission. 
The  primary  objectives  of  the  RIP  research  study  were  as  follows: 

1 .  To  determine  the  influence  of  moisture  content  during  compaction  on  soil 
microstructure  and  soil  properties  (compressibility,  wave  speed,  transmission  ratio, 
strength);  and 

2.  To  investigate  and  qualitatively  describe  the  development  of  microstructure  in 
compacted  unsaturated  sand  and  its  effect  on  stress  transmission  from  conventional 
weapons  effects. 

III.  BACKGROUND 

A.  Stress  Transmission  in  Unsaturated  Soils 

Recent  experimental  investigations  by  Ross  et  al.  (1986)  and  Ross  (1989)  have 
demonstrated  that  compacting  moist  sands  at  varying  degrees  of  saturation  prior  to  dynamic 
testing  can  increase  the  stress  transmission  ratio  by  as  much  as  a  factor  of  two  and  can  also 
lead  to  increased  stress  wave  propagation  velocities.  Additional  research  by  Charlie  and 
Pierce  (1988)  to  study  the  influences  of  capillary  stresses  on  the  behavior  of  the  same  soils 
tested  by  Ross  further  confirmed  and  extended  those  findings.  They  also  demonstrated  that 
the  influence  of  capillary  pressures  on  the  stiffness  of  sands  is  negligible,  being  on  the 
order  of  about  7  kPa  (1  psi)  or  less  which  is  insignificant  in  comparison  with  high  intensity 
transient  dynamic  loadings.  Therefore,  it  appears  that  capillary  pressures  do  not  directly 
affect  a  granular  soil's  ability  to  transmit  stresses.  However,  studies  performed  by  Ross  et 
al.  (1986),  Ross  (1989),  Veyera  (1989)  and  as  a  part  of  this  investigation  suggest  that 
capillary  pressures  may  strongly  influence  the  soil  microstructure  developed  during 
compaction  (including  soil  placement  and  soil  formation  in  the  field)  which  could 
significantly  affect  both  the  static  and  dynamic  behavior  of  soil.  Recent  studies  by  Charlie 
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and  Walsh  (1990)  were  conducted  to  determine  the  response  of  unsaturated  soils  to  small 
scale  explosive  detonations  in  the  centrifuge.  Their  results  have  also  shown  stress 
transmission  to  depend  on  compaction  moisture  conditions. 

Studies  by  a  number  of  other  investigators  have  also  shown  that  the  compaction  of 
sands  with  moisture  present  has  a  measurable  influence  on  both  static  and  dynamic  soil 
properties  v>hich  can  be  attributed  to  variations  in  soil  stiffness  and  microstructure. 
Mitchell  et  al.  (1976)  studied  the  effects  of  sand  fabric  and  sample  preparation  method  on 
the  liquefaction  behavior  of  sands.  They  performed  cyclic  triaxial  tests  and  observed 
significant  differences  in  behavior  based  on  how  specimens  were  prepared.  Mulillis  et  al. 
(1977)  investigated  11  different  packing  methods  and  showed  that  the  method  of 
compaction,  particularly  the  initial  moisture  conditions,  strongly  influences  the  cyclic 
liquefaction  behavior  of  fine  sands.  Wu  et  al.  (1984)  performed  resonant  column  tests  on 
fine  sands  and  observed  that  capillary  pressures  in  specimens  compacted  moist  at 
saturations  in  the  range  of  from  5%  to  20%  produced  a  significant  increase  in  the  dynamic 
shearing  modulus. 

An  interesting  study  to  investigate  the  effects  of  saturation  on  wave  velocity  in 
sandstones  was  reported  by  Hughes  and  Kelly  (1952)  who  directly  measured  variations  in 
dilatational  velocity  with  saturation  on  rock  core  specimens  using  pulse  techniques.  As  a 
part  of  the  study,  both  temperature  and  confining  pressure  were  varied.  Results  indicated 
that  wave  velocity  increased  between  0%  and  about  20%  saturation,  remained  essentially 
constant  between  about  20%  and  90%  saturation,  and  then  decreased  thereafter  for 
pressures  between  105  and  526  kPa  (725  and  3626  psi)  at  room  temperature  (see  Figure 
1).  At  higher  pressures,  the  wave  velocity  was  fairly  constant  up  to  about  90%  saturation. 
Wyllie  at  al.  (1956)  performed  tests  similar  to  those  of  Hughes  and  Kelly  (1952)  on  Berea 
sandstone  and  further  substantiated  the  previous  findings.  These  results  are  remarkably 
similar  to  those  obtained  by  Ross  et  al.  (1986),  Ross  (1989),  Veyera  (1989)  and  the  study 
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Normalized  Dilatational  Wave  Velocity 


Core  No.  188,  Sample  No.  46  -  Silty  Shale 


Figure  1.  Normalized  Dilatational  Wave  Propagation  Velocity  as  a  Function 
of  Saturation  and  Confining  Pressure  (Hughes  and  Kelly,  1952). 
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presented  in  this  report  for  SHPB  tests  performed  on  uncemented  sands  compacted  at 
different  saturations  and  a  constant  dry  density. 

B.  Microstructural  Analysis 

Various  studies  have  been  conducted  in  an  attempt  to  characterize  and  relate  soil 
structure  at  the  microscopic  level  to  the  engineering  properties  of  soils  (macroscopic  level). 
These  investigations  have  resulted  in  the  development  of  techniques  for  microstructural 
analysis  of  soil.  The  procedures  generally  involve  thin  sectioning  of  epoxied  specimens 
and  the  characterization  of  directional  grain  and  pore  space  orientations  by  statistical 
analysis.  Then  a  correlation  with  engineering  behavior  can  be  extrapolated  from  the 
results.  Standard  petrographic  analysis  procedures,  commonly  used  in  geologic  studies, 
are  available  in  the  open  literature  (Brewer,  1962;  Turner  and  Weiss,  1963).  These 
techniques  have  been  modified  for  use  in  geotechnical  engineering  analyses  of  soil 
microstructure  (Campbell,  1985;  LaFeber  1965,  1972;  Oda  1972a,  1972b)  and  have  been 
adopted  for  use  in  this  study.  A  few  particular  approaches  and  observations  are  presented 
in  this  section. 

LaFeber  (1965)  developed  a  method  for  analyzing  soil  microstructure  in  terms  of 
the  spatial  orientation  of  planar  pore  patterns.  He  indicates  their  significance  by  stating  that 
these  patterns  "...can  be  expected  to  be  an  expression  of  the  depositional  and/or  stress- 
strain  history  of  the  particular  soils.  As  such,  they  should  be  of  paramount  importance  in 
the  study  of  the  mutual  relations  between  soil  fabric  and  soil  mechanical  behavior." 
LaFeber  (1972)  further  extended  his  work  to  the  analysis  of  three-dimensional  grain 
orientations  and  anisotropic  fabrics  by  studying  compound  linear  and  planar  fabric  patterns 
and  demonstrated  the  influence  on  soil  fabric  on  the  mechanical  properties  of  several  soils. 
Oda  (1972a,  1972b)  noted  that  particle  arrangements  in  sands  are  determined  by  both  the 
particle  shape  and  the  method  of  compaction  and  that  initial  fabric  is  important  to 
mechanical  properties.  He  also  developed  a  method  for  characterizing  fabric  featxues  in 


terms  of  a  projected  area  ratio  which  can  be  used  to  estimate  mechanical  properties  of 
sands. 

Mahmood  and  Mitchell  (1974)  studied  particle  orientation  as  a  function  of  three 
compaction  methods:  pouring,  dynamic  compaction,  and  static  compaction.  They  found 
that  a  nearly  random  particle  orientation  was  produced  by  dynamic  compaction.  The  static 
technique  resulted  in  grain  orientations  that  were  nearly  45  degrees  from  a  horizontal  plane 
while  the  pouring  method  produced  a  strong  preferred  horizontal  orientation  of  particles. 
Mitchell  et  al.  (1976)  studied  the  effects  of  sand  fabric  and  sample  preparation  method  on 
the  liquefaction  behavior  of  sand  and  observed  differences  in  dynamic  behavior  based  on 
the  specimen  preparation  method  used.  Microstructural  analyses  of  specimens  showed 
measurable  differences  in  grain  orientations  and  resultant  fabrics  from  the  different 
compaction  methods.  In  addition,  Mitchell  et  al.  (1978)  investigated  the  fabric  of 
undisturbed  sands  from  the  city  of  Niigata  Japan,  the  site  of  a  major  earthquake  in  1964, 
and  also  observed  the  influence  of  microstructure  on  dynamic  soil  behavior.. 

Using  mercury  intrusion  porosimetry  techniques,  Juang  and  Holtz  (1986)  were  able 
to  characterize  the  effects  of  compaction  and  compaction  moisture  content  by  a  Pore  Size 
Distribution  (PSD)  index.  They  observed  distinct  pore  size  distributions  for  the  different 
packing  methods  investigated.  Nimmo  and  Akstin  (1988)  found  that  the  permeability  of 
sandy  soils  was  highly  dependent  upon  the  compaction  method,  especially  moist 
compaction.  They  attributed  variations  in  permeability  to  preferred  grain  orientations 
during  compaction  as  a  result  of  moisture  being  present  as  opposed  to  dry  packing  which 
should  typically  give  a  random  orientation. 

These  various  studies  have  recognized  and  described  the  influence  of  soil 
microstructure  on  soil  behavior.  However,  a  clear  and  concise  explanation  of  the 
phenomena  observed,  especially  with  respect  to  dynamic  behavior,  is  not  currently 
available  and  considerable  research  remains  to  be  done  in  this  area. 
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IV. 


The  investigation  described  herein  was  conducted  to  examine  the  stress 
transmission  behavior  and  development  of  microstructure  in  dynamically  compacted 
specimens  of  unsaturated  sand.  The  SHPB  device  at  RDCM  was  used  to  obtain  data  on 
dynamic  stress  transmission  characteristics.  Compacted  specimens  were  also  epoxied  and 
sectioned  for  microstnictural  analysis  to  study  the  orientation  of  particles  as  a  function  of 
compaction  moisture  (saturation).  These  aspects  of  the  study  are  outlined  in  the  following 
sections. 

A.  Description  of  Granular  Material  Tested 

A  commercially  available  granular  soil  designated  as  "Ottawa  20-30  sand" 
was  used  in  all  tests.  Approximately  227  kg  (500  lbs)  of  the  sand  was  obtained  from  the 
Ottawa  Silica  Company  and  random  samples  were  taken  from  the  bulk  quantity  for  this 
investigation.  The  material  is  a  uniformly  graded,  subrounded  to  rounded,  medium  sand 
with  no  fines  and  is  classified  as  SP  according  to  the  Unified  Soil  Classification  System 
(USCS).  Physical  index  properties  for  the  Ottawa  20-30  sand  are  summarized  in  Table  1 
and  a  grain  size  distribution  curve  is  shown  in  Figure  2. 

B.  Dynamic  Compaction  of  Specimens 

For  this  study,  Ottawa  20-30  sand  specimens  were  dynamically  compacted  in  a 
thick-walled  stainless  steel  tube  to  a  constant  dry  density  of  1.715  g/cm^  (107.0  pcf)  at 
varying  degrees  of  saturation  (different  initial  moisture  contents).  A  thick-walled  tube  was 
used  to  simulate  the  one-dimensional  boundary  condition  encountered  in  the  field  near  an 
explosion.  The  tube  was  7.62  cm  (6.00  inches)  long  with  an  inside  diameter  of  5.08  cm 
(2.00  inches)  and  a  wall  thickness  of  2.54  cm  (1.00  inches).  The  compaction  process  used 
a  Standard  Proctor  hammer,  ASTM  D-698  (ASTM  1990),  to  consistently  apply  a 
controlled  amount  of  compactive  effort  per  impact  to  each  soil  specimen  (7.5  Joules  or  5.5 
ft-lbs  per  impact).  Specimens  were  formed  using  four  individually  compacted  layers  of 
equal  weight  such  that  a  final  total  specimen  length  of  10.16  cm  (4.00  inches)  was 
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obtained.  The  final  compacted  thickness  of  each  lift  was  2.54  cm  (1  inch)  and  the  degree 
of  saturation  was  varied  from  near  0%  (dry)  to  about  75%  .  Figure  3  shows  a  schematic  of 
a  typical  specimen  being  compacted. 

TABLE  1.  Physical  Properties  of  Ottawa  No.  20/30  Sand. 


uses  Classification 

SP 

Specific  Gravity 

2.65 

D50  particle  size 

0.70  mm 

^u 

1.40 

bQ 

1.03 

^Percent  passing  #100  sieve 

<1  % 

^Maximum  dry  density 
‘^Minimum  dry  density 

1,763  kg/m3 

1,587  kg/m3 

Maximum  void  ratio 

Minimum  void  ratio 

0.669 

0.504 

Note:  Coefficient  of  U niformity 
bCoefficient  of  Curvature 

C.S.  Standard  sieve  size 
^Data  from  Ottawa  Silica  Sand  Co. 

At  saturations  near  0%,  each  layer  of  dry  soil  was  poured  directly  into  the  tube  and 
then  compacted.  In  preparing  moist  specimens,  the  required  amount  of  water  for  a  given 
degree  of  saturation  (at  final  compacted  density)  was  added  to  the  originally  dry  soil, 
thoroughly  mixed  in  and  then  allowed  to  equilibrate  before  compacting.  The  amount  of 
compactive  effort  required  tc  achieve  the  desired  dry  density  was  observed  lO  vary  with  the 
amount  of  moisture  present  (saturation).  Since  the  specimens  ranged  in  degree  of 
saturation  from  near  0%  to  about  75%,  the  tests  were  conducted  on  unsaturated  specimens 
which  implies  that  both  continuous  air  and  water  phases  exist  in  the  soil  (eg.  pockets  of 


30-13 


Percent  Finer 


Compaction  Hammer  - 


Figure  3. 


occluded  air  or  water  are  not  present).  For  most  soils,  this  generally  occurs  at  saturations 
less  than  about  85%  (Corey,  1977). 

C.  Split-Hookinson  Pressure  Bar  (SHPB)  Tests 

The  SHPB  device  has  been  used  by  several  researchers  to  study  dynamic  stress 
transmission  in  soils  (see  for  example  Charlie  et  al.,  1990;  Felice  et  al.,  1987;  Ross  et.  al., 
1986;  Ross,  1989;  Veyera,  1989).  The  system  subjects  a  specimen  to  a  one-dimensional 
transient  compressive  stress  wave  generated  by  impacting  a  steel  bar  on  one  side  of  the 
specimen  with  a  projectile.  An  identical  steel  bar  on  the  other  side  of  the  specimen  captures 
the  stress  wave  transmitted  by  the  soil  specimen.  Transmitted  energy  and  wave  speed  are 
determined  by  analyzing  tnu.sient  measurements  from  strain  gages  mounted  on  the  incident 
and  transmitter  bars.  Details  of  the  SHPB  device,  principles  of  operation,  and  theory  are 
given  by  Ross  (1989).  A  schematic  of  a  soil  specimen  in  the  SHPB  device  ready  for 
testing  is  shown  in  F'  ~ure  4. 

Two  serici  of  dynamic  compressive  tests  were  performed  on  the  Ottawa  20-30 
sand  to  study  stress  transmission  characteristics  as  a  function  of  saturation  and 
microstructure.  One  series  involved  compacting  specimens  moist  and  then  immediately 
testing  them  in  the  SHPB  device.  The  other  series  involved  compacting  the  specimens 
moist,  careiuiiy  drying  them  in  the  oven  and  then  testing  them  in  the  SHPB.  The  second 
series  was  used  to  evaluate  the  importance  of  the  presence  of  water  in  the  pore  spaces 
during  testing  ind  also  to  determine  if  the  effects  produced  during  moist  compaction 
remained  in  the  microstructure  after  the  moisture  is  removed. 

D.  Epoxving  and  Sectioning  of  Compacted  Specimens 

A  commercially  available  bonding  agent  "EPOTEK  301"  was  used  to 
preserve  the  structure  of  prepared  compacted  specimens  for  microstructural  analysis. 
EPOTEK  301  is  a  high  strength,  spectrally  transparent,  low  viscosity  (100  cps),  two 
component  ejx)xy.  Curing  time  is  typically  about  one  hour  in  a  dry,  temperature  controlled 
oven  at  65±2  and  specimens  can  also  be  cured  at  room  temperature  overnight.  When 
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fully  cured,  the  epoxy  is  strongly  bonded  to  the  specimen  grains  and  maintains  the  integrity 
of  the  soil  structure  during  cutting  and  polishing  operations.  Commercially  available 
biological  stains  (Sudan  IV  and  Sudan  Black)  were  mixed  with  the  epoxy  as  a  coloring 
agent  to  provide  adequate  contrast  between  the  pores  and  grains.  Details  of  the  laboratory 
procedure  for  preparing,  epoxying  and  sectioning  compacted  sand  specimens  are  given  in  a 
previous  report  (Veyera  and  Fitzpatrick,  1990),  and  the  general  procedure  is  briefly 
outlined  as  follows: 

a)  Specimens  compacted  moist  are  dried  overnight  in  a  temperature  controlled 
oven.  Specimens  compacted  dry  are  epoxied  immediately  (step  b). 

b)  EPOTEK  301  epoxy  is  carefully  introduced  into  the  specimen  under 
atmospheric  pressure  to  saturate  the  void  spaces.  The  specimen  is  then  cured  in 
a  temperature  controlled  oven  at  65+2  for  about  1  hour. 

c)  After  curing,  the  specimen  is  extruded  from  the  stainless  steel  container. 

d)  The  sectioning  process  involves  2  major  steps  with  several  sub-steps: 

i)  Initial  Rough  Cutting  -  1)  Preliminary  cutting  of  specimen;  and  2) 
Preparation  of  vertical  and  horirontal  sections  for  grinding  and  polishing; 

ii)  Grinding  and  Polishing  -  1)  Initial  coarse  grinding;  2)  Rough  polishing; 
and  3)  Fine  finish  polishing. 

The  procedure  outlined  was  used  to  obtain  representative  sections  along  both  the 
specimen  vertical  longitudinal  axis  and  perpendicular  to  the  longitudinal  axis  at  specimen 
mid-height.  Locations  of  the  individual  cut  sections  are  shown  in  Figure  5.  After  final 
polishing,  the  specimen  sections  are  ready  for  two-dimensional  microstructural  analysis. 

E.  Microstructural  Analysis 
1.  Two-Dimensional  Imaging 

After  the  epoxied  specimens  have  been  sectioned,  a  two-dimensional  image 
of  the  grain  and  void  arrangements  was  obtained  for  microstructural  analysis  using 
photomicrographic  techniques.  A  35  mm  camera  with  special  attachments  was  mounted  on 
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Figure  5.  Location  of  Cutting  Planes  for  Epoxied  Specimens. 
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a  reflected  light  microscope  with  direct  lighting  used  to  illuminate  specimen  cut  sections 
from  above.  Each  specimen  section  was  carefully  leveled  and  centered  on  the  microscope 
platen.  A  magnification  of  lOx  was  used  on  all  specimen  sections.  Standard  35  mm  color 
slide  film  (100  ASA)  and  a  shutter  speed  1/60  second  were  used  when  photographing  the 
cut  section  surfaces,  which  provided  optimum  visual  contrast  between  grains  and  epoxy. 

Each  cut  section  was  systematically  photograph  according  to  a  prescribed  grid 
position  system  (see  Figure  6).  A  precise  record  of  frame  number  and  corresponding  grid 
position  was  maintained  for  proper  identification  of  slides  during  analysis.  For  the 
purposes  of  this  study,  only  five  grids  positions  on  the  venical  cut  section  (k,  d,  1,  c,  and 
e)  were  photographed  for  specimens  compacted  at  each  saturation  of  0,  18.7,  37.4,  56.2  , 
and  75%  (The  remaining  grid  positions  are  currently  being  analyzed.).  Two-dimensional 
particle  measurements  were  made  by  projecting  the  slides  onto  a  sheet  of  paper  and  tracing 
individual  particles.  This  data  was  then  used  to  analyze  the  microstnicture  of  the 
compacted  specimens.  Axial  ratio  and  particle  long  axis  orientation  characterization 
parameters  were  determined  following  procedures  presented  by  Oda  (1972a)  and  (Campbell 
(1985),  respectively. 

2.  Determination  of  Axial  Ratio 

A  shape  factor  for  studying  individual  grain  panicles  is  the  axial  ratio,  n, 
which  is  obtained  by  taking  measurements  from  a  two-dimensional  section  of  a  particle. 
The  axial  ratio  is  an  indication  of  how  closely  a  panicle  is  to  being  spherical  in  shape  and 
can  range  from  0  to  1.  For  example,  a  ratio  of  0.9  would  indicate  a  panicle  shape 
approaching  that  of  a  sphere  (1.0  would  be  a  sphere),  whereas  a  ratio  of  0.1  would  indicate 
an  elliptical  or  elongated  shape.  The  procedure  suggested  by  Oda  (1972a)  was  followed: 

1)  determiiie  the  length  of  the  particle's  shon  axis,  L2,  by  finding  the  diameter  of 
the  largest  possible  circle  that  can  be  inscribed  on  the  particle  (see  Figure  7a), 
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a.  Grid  Positions  for  Vertical  Specimen  Cut  Sections. 


Section  B-B  of  Compacted 
Soil  Specimen 


Grid  Positions 


b.  Grid  Positions  for  Horizontal  Specimen  Cut  Sections. 


Figure  6.  Location  of  Specimen  Grid  Positions  for  Photographic  and  Microstructural  Analysis. 
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Individual  Particle 


Individual  Particle 


a.  Short  Axis  (L2)  Determination 


b.  Long  Axis  (Li)Detcrmination. 


Figure  7.  Determination  of  Axial  Ratio  Parameters  for  an  Individual  Particle. 


2)  determine  the  length  of  the  particle's  long  axis  dimension,  Li,  by  finding  the 
diameter  of  the  smallest  possible  circle  that  can  be  circumscribed  on  the  particle  (see  Figure 
7b), 

3)  the  axial  ratio  for  an  individual  particle,  n,  is  then; 


n 


(Eq.  1) 


4)  the  axial  ratio  for  a  group  of  particles.  Uavg ,  is  then: 


n 


avg 


(Eq.  2) 


Axial  ratio  data  are  useful  in  classifying  the  general  shape  of  particles  and  arc  also 
an  indication  of  the  potential  arrangement  of  particles  that  can  be  expected  from  compaction 
(ie.,  the  closer  to  being  spherical  in  shape,  the  more  random  the  arrangement).  In  addition, 
some  physical  properties  of  sands  have  been  correlated  with  axial  ratio  results  (Oda, 
1972a).  Table  2  provides  rankings  and  visual  descriptions  of  particle  shapes  based  on  axial 
ratio  data  (roundness  ratio),  which  would  classify  Ottawa  20-30  sand  as  consisting  of  well 
rounded  particles. 

3.  Determination  of  Particle  Long  Axis  Orientation 

Particle  long-axis  orientation  is  a  standard  method  used  in  the 
microstructural  analysis  of  granular  soils.  The  long-axis  orientation  of  individual  particles 
was  evaluated  based  on  the  method  developed  by  Campbell  (1985).  The  technique 
involves  determining  both  longest  axis,  Li,  and  its  shonest  axis,  L2,  of  a  particle.  A 
rectangle  is  then  constructed  around  the  grain  and  the  orientation  angle,  0,  made  by  the 


30-23 


long-axis  direction  with  respect  to  a  set  of  Cartesian  coordinate  system  reference  axes  is 
measured  (see  Figure  8).  For  each  grid  position,  orientation  measurements  (class  intervals) 
were  made  at  10°  intervals  and  used  to  construct  frequency  histograms  of  the  data.  The 
results  then  provide  information  on  any  preferred  orientation  of  panicles  that  may  occur 
locally  in  a  specimen  or  globally  for  the  specimen  as  a  whole.  Correlations  between 
compaction  method  (pluvial,  static,  or  dynamic)  and  soil  microstructure  can  also  be  related 
to  preferred  particle  orientation  data. 

TABLE  2.  Description  of  Roundness  Classes  (Pettijohn,  1957). 


Class  Name 

Roundness  Valued 

Description 

Angular 

0-0.15 

Strongly  developed  faces  with  sharp  edges 
and  comers;  secondary  comers*^  are 
numerous. 

4 

Subangular 

0.15  -  0.25 

Strongly  developed  faces  with  somewhat 
rounded  edges  and  comers;  secondary 
comers  are  numerous. 

• 

Subrounded 

0.25  -  0.40 

The  edges  and  comers  are  rounded  and  the 
area  of  flat  faces  is  comparatively  small; 
secondary  comers  are  much  rounded  and 
reduced  in  number  (5  -  10). 

1 

Rounded 

0.40  -  0.60 

Flat  faces  are  practically  absent;  all  edges  and 
comers  are  rather  broad  curves,  and  there 
may  be  broad  re-entrant  angles;  secondary 
comers  have  disappeared. 

Well-Rounded 

0.60-  1.00 

There  are  no  flat  faces;  the  entire  surface 
consists  of  broad  curves. 

Note:  ^Roundness  value  is  equivalent  to  axial  ratio. 

*^Secondary  comers  are  the  many  minor  convexities  seen  in  the  grain  profile. 
Primary  comers  are  the  principal  interfacial  edges  and  are  fewer  in  number  (3-5). 
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+90  Reference  Axes 


-90 


Figure  8.  Determination  of  Particle  Long  Axis  (Li)  Orientation  Angle  0. 
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V.  RESULTS  and  DISCUSSION 


A.  Dynamic  Compactive  Energy 

A  record  was  kept  during  compaction  of  the  number  of  blows  required  to 
achieye  a  dry  density  of  1.715  g/cm3  (107.0  pcO  at  each  saturation  for  each  specimen. 
Figure  9  shows  the  ayerage  compactiye  energy  as  a  function  of  saturation  (based  on  results 
from  47  compacted  specimens).  The  data  has  been  normalized  to  the  ayerage  total 
compactiye  energy  required  at  0%  saturation  for  the  first  layer.  The  results  show  that  the 
required  compactiye  energy  increases  from  0%  to  about  20%  saturation,  remains  constant 
from  about  20%  to  50%  saturation,  and  then  decreases  thereafter. 

Figure  10  shows  the  compactiye  energy  data  norma,  ized  to  the  ayerage  compactiye 
energy  required  at  0%  saturation  for  each  indiyidual  layer.  The  greatest  compactiye  effort 
was  always  applied  to  the  first  layer  regardless  of  moisture  present  The  data  suggest  that 
the  required  compactiye  energy  does  not  decrease  at  higher  saturations.  It  may  be  that  at 
higher  saturations  the  pore  water  is  more  effective  in  resisting  the  compaction  energy  in  a 
single  layer,  particularly  since  water  is  unable  to  migrate  to  other  layers.  There  also  may  be 
a  partial  loosening  of  the  soil  in  this  first  layer  due  to  the  stress  wave  reflected  from  the 
pedestal  during  compaction.  The  second  and  third  layers  require  about  the  same  amount  of 
compactive  effort  and  show  a  reduction  in  required  compaction  energy  above  about  50% 
saturation.  The  fourth  and  final  layer  required  the  least  amount  of  compactive  effort  but 
still  exhibited  a  dependency  on  moisture.  Also,  layers  2,  3  and  4  show  a  trend  with 
increasing  saturation  similar  to  that  shown  in  Figure  9 . 

From  the  results  shown  in  Figures  9  and  10,  it  can  be  seen  that  there  is  a  strong 
dependency  of  compactive  energy  on  moisture  for  a  constant  dry  density  psoking.  This  can 
be  attributed  to  variations  in  overall  specimen  stiffness  with  moisture  which  may  be  due  to 
the  formation  of  preferred  particle  orientations  and  the  presence  of  capillary  pressures 
during  compaction. 


30-26 


Normalized  Average  Total  Compaction  Energy 


Tube  Wall  Thickness  =  2.54  cm  (1.0  inches) 


Figure  9.  Normalized  Average  Total  Compactive  Energy  for  Ottawa  20-30  Sand 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcf). 
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Normalized  Average  Blows 


I 


Tube  Wall  Thickness  =  2.54  cm  (1.0  inches) 


'“'aturation  (%) 


Figure  10.  Normalized  Average  Compactive  Energy  as  a  Function  of  Saturation  by 
Layer  fc:  Ottawa  20-30  Sand  Compacted  to  Final  Dry  Density  of 
1.7 15  g/cc  (107.0  pcO. 
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B.  Split-Hopkinson  Pressure  Bar  fSHPB)  Tests 


Two  series  of  dynamic  compressive  tests  using  the  SHPB  were  performed  on 
Ottawa  20-30  sand  to  study  stress  transmission  characteristics  as  a  function  of  saturation 
and  microstructure.  One  series  involved  compacting  specimens  moist  and  then  testing  them 
immediately  to  provide  information  about  the  effect  of  moisture  during  compaction  and 
testing  on  the  dynamic  soil  response.  The  second  series  was  conducted  on  specimens 
compacted  moist,  oven-dried,  and  then  tested  in  the  SHPB.  These  tests  were  used  to 
determine  if  the  conditions  developed  during  compaction  remained  locked  in  the  soil 
structure  even  after  the  moisture  has  been  removed  from  the  pores.  Care  was  used  in 
handling  these  specimens  so  that  the  structure  formed  during  compaction  would  not  be 
disturbed  prior  to  testing. 

The  data  shown  in  Figures  1 1  and  12  represent  average  normalized  values  obtained 
from  24  specimens  compacted  moist  and  tested  moist,  and  23  specimens  compacted  moist 
and  tested  dry  from  this  study.  In  addition,  the  results  from  35  specimens  compacted  moist 
and  tested  moist  by  Ross  (1989)  are  also  shown.  The  data  in  the  figures  have  been 
normalized  to  the  average  value  at  0%  saturation  from  each  study.  Ottawa  20-30  sand 
specimens  and  identical  loading  and  boundary  conditions  were  used  in  each  investigation. 
The  only  difference  was  in  the  compacted  dry  densities  which  were  1.715  g/cc  (107.0)  in 
this  study,  and  1.750  g/cc  (109.2  pcf)  in  Ross’s  investigation. 

Figure  1 1  shows  the  variation  in  measured  dilatational  wave  speed  normalized  to 
the  value  at  0%  saturation.  The  wave  speed  is  a  measure  of  the  compressive  wave 
propagation  velocity  through  the  compacted  soil  and  is  a  function  of  material  density  and 
stiffness.  For  specimens  compacted  moist  and  tested  moist,  the  wave  speed  increases  from 
0%  to  about  20%  saturation,  remains  constant  from  about  20%  to  50%  saturation,  and  then 
decreases  thereafter.  The  data  from  Ross  (1989)  show  a  similar  trend;  however,  the  wave 
speed  magnitudes  are  lower  which  was  not  expected  since  the  dry  density  (stiffness)  was 
higher.  The  differences  may  be  due  to  the  fact  that  velocities  are  based  on  arrival  times 
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■  Compacted  Moist  and  Tested  Moist 
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Figure  11.  Normalized  Average  Wave  Speed  as  a  Function  of  Saturation 
for  Ottawa  20-30  Sand. 
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Normalized  Average  Transmission  Ratio 


■  Compacted  Moist  and  Tested  Moist 

D17  Density  =  1.715  g/cc  (107.0  pef) 

•  Compacted  Moist  and  Tested  Dry 

Dry  Density  =  1.715  g/cc  (107.0  pcO 

^  Compacted  Moist  and  Tested  Moist 
(Ross.  1989) 

Dry  Density  =  1.750  g/cc  (109.2  pcO 


Saturation  (%) 


Figure  12.  Normalized  Average  Transmission  Ratio  as  a  Function  of  Saturation 
for  Ottawa  20-30  Sand. 
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which  must  be  interpreted  from  the  recorded  strain  gage  time-history  data.  In  any  event, 
the  same  behavior  is  evident 

An  interesting  feature  in  the  normalized  wave  speed  data  is  that  magnitudes  above 
about  60%  saturation  are  lower  than  those  at  0%.  This  implies  that  the  presence  of 
moisture  during  testing  may  have  a  lubricating  effect  as  the  compressive  energy  passes 
through  the  specimen.  Particles  would  then  be  able  to  move  somewhat  more  freely  relative 
to  each  other  and  absorb  more  energy. 

Figure  1 1  also  shows  the  normalized  wave  speed  data  for  specimens  compacted 
moist  and  tested  dry.  The  saturations  associated  with  this  data  represent  the  moisture 
conditions  during  compaction.  The  data  suggests  a  trend  similar  to  that  previously 
described;  however  the  wave  speeds  are  generally  larger  particularly  at  about  20% 
saturation  and  decrease  with  increasing  saturation.  This  indicates  larger  specimen 
stiffnesses  compared  with  those  tested  moist  at  the  same  saturations.  In  addition,  no 
plateau  region  exists  and  the  wave  speed  remains  above  that  at  0%  saturation  regardless  of 
the  moisture  condition  during  compaction.  Therefore,  it  appears  that  soil  microstructural 
characteristics  developed  during  compaction  that  influence  stress  wave  propagation  velocity 
remain  intact  even  after  the  moisture  in  the  pores  has  been  removed. 

Figure  12  shows  the  variation  in  stress  transmission  ratio  normalized  to  the  value  at 
0%  saturation.  The  transmission  ratio  is  a  measure  of  the  impact  energy  transmitted  by  the 
soil  specimen  after  loading.  For  specimens  compacted  moist  and  tested  moist,  the 
transmission  ratio  increases  from  0%  to  about  30%  saturation,  remains  constant  from  about 
30%  to  50%  saturation,  and  then  decreases  thereafter.  The  data  from  Ross  (1989)  show  a 
similar  trend;  however,  the  transmission  ratio  magnitudes  are  higher  which  was  expected 
due  to  the  greater  dry  density  (stiffness).  Since  these  results  are  based  on  peak  values  from 
the  time-history  records,  no  interpretation  of  the  data  was  necessary  and  the  results 
therefore,  appear  to  be  more  consistent  with  anticipated  trends.  The  results  in  Figure  12 
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further  show  that  the  transmitted  energy  at  higher  saturations  (above  about  65%)  is  less 
than  at  0%  saturation  which  may  be  due  to  a  lubricating  effect  as  previously  noted. 

Figure  12  also  shows  the  normalized  transmission  ratio  data  for  specimens 
compacted  moist  and  tested  dry.  The  data  suggests  a  trend  similar  to  that  previously 
described;  however  the  transmission  ratios  are  larger  and  only  gradually  decrease  with 
increasing  saturadon.  As  with  the  wave  speed  data,  this  also  indicates  larger  specimen 
stiffnesses  compared  with  those  tested  moist  at  the  same  saturations.  There  is  no  plateau 
region  and  the  transmission  ratio  remains  above  that  at  0%  saturation  regardless  of  the 
moisture  condition  during  packing.  These  results  also  support  the  idea  that  the  soil 
microstructural  characteristics  developed  during  compaction  which  influence  stress 
transmission,  remain  intact  even  after  drying. 

C.  Microstructural  Analysis 

Figure  13  shows  the  results  of  the  axial  ratio  particle  analysis  and  is  a 
composite  plot  of  over  2500  individual  particles  which  were  manually  analyzed.  The 
results  indicate  that  for  these  particles  the  average  axial  ratio  is  0.674  (standard  deviation  of 
0.106)  which  means  that  the  particles  are  approaching  spheres  in  shape.  Based  on  the  axial 
ratio  data  obtained,  the  Ottawa  20-30  sand  would  be  classified  as  consisting  of  well 
rounded  particles  according  to  the  criteria  shown  in  Table  2. 

Figures  14  through  28  show  the  results  of  the  particle  orientation  analyses  for  5 
different  grid  positions  k,  d,  1,  c  and  e  (see  Figure  6)  at  saturations  of  0,  18.7,  37.4,  56.2 
and  75%.  These  grid  positions  are  located  in  a  the  vertical  plane  (section  A-A,  Figure  5)  at 
the  center  of  the  specimen  away  from  the  boundaries.  Each  grid  position  contains 
approximately  1(X)  individual  Ottawa  20-30  sand  particles.  The  entire  grid  is  currently 
being  analyzed  for  both  the  horizontal  and  venical  cut  sections  and  the  results  will  be 
forthcoming  in  a  future  report. 

The  composite  results  for  grid  positions  c,  d,  e  which  extend  horizontally  across  the 
cut  section,  are  shown  in  Figures  14-18,  and  those  for  grid  positions  k,  d,  1  which  extend 
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Axial  Ratio 

Figure  13.  Axial  Ratio  Histogram  for  Ottawa  20-30  Sand  Particles. 


vertically  across  the  cut  section  are  shown  in  Figures  19-23.  The  data  shown  in  these 
figures  do  not  suggest  any  particular  preferred  orientation  along  the  directions  examined  for 
either  grid  position  grouping. 

Considering  the  extent  of  the  data  analyzed  to  date,  it  is  probably  more  useful  to 
view  the  results  as  a  whole  and  look  at  a  composite  representation  of  all  grid  positions  at 
each  saturation.  These  results  are  shown  in  Figures  24-28  and  Table  3  is  a  summary  of  the 
maximum  frequency  of  particle  orientations  based  on  the  three  peak  values  obtained  from 
each  figure.  The  general  trend  of  the  data  shows  that,  for  the  most  part,  some  preferred 
grain  orientations  occur  between  about  -40°  and  +20°.  However,  there  does  not  appear  to 
be  a  strongly  preferred  orientation  at  any  saturation.  This  may  be  due  to  the  relatively  small 
number  of  particles  examined  so  far,  and  these  results  may  change  when  the  remaining  data 
are  analyzed.  Also,  the  axial  ratio  determined  for  Ottawa  20-30  sand  was  0.674  which 
means  that  the  particles  are  approaching  a  spherical  shape,  and  therefore,  may  not  exhibit 
any  signifleant  degree  of  preferred  orientation.  This  would  be  consistent  with  the  findings 
of  Oda  (1972a)  who  observed  that  for  axial  ratios  at  or  above  0.70,  particles  tend  to  pack  at 
random  orientations.  If  this  turns  out  to  be  the  case  for  Ottawa  20-30  sand  once  the 
remaining  data  has  been  analyzed,  then  the  influence  of  other  mechanisms,  such  as  locked 
in  stresses  and  stress  ani.sotn)py  will  need  to  be  considered.  At  this  point,  it  is  difficult  to 
develop  any  correlations  between  the  microstructural  analysis  data  and  the  results  of  the 
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Figure  14.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  0% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcf). 
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Tube  Wall  Thickness  =  2.54  cm;  Grid  Positions  =  k,  d,  1 


Orientation  Angle,  0  (degrees) 


Figure  16.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  37.4% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcf). 


30-38 


Frequency 


Tube  Wall  Thickness  =  2.54  cm;  Grid  Positions  =  k,  d,  1 


30 


-90  -80  -70  -60  -50  -40  -30  -20  -10  10  20  30  40  50  60  70  80  90 


Orientation  Angle,  0  (degrees) 


Figure  17.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  56.2% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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Figure  18.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  75.0% 
Compacted  to  a  dry  Density  of  1.715  g/cc  (107.0  pcf). 
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Figure  19.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  0% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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Figure  20.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  18.7% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO. 
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Figure  21.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  37.4% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcf). 
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Figure  22.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  56.2% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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Figure  24.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  0% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcf). 


30-46 


Frequency 


Tube  Wall  Thicloiess  =  2.54  cm;  Grid  Positions  =  k,  d,  1,  c,  e 


Orientation  Angle,  0  (degrees) 


Figure  25.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  18.7% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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Figure  26.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  37.4% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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Figure  27,  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  56.2% 
Compacted  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO. 
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Figure  28.  Particle  Orientation  Histogram  for  Ottawa  20-30  Sand  at  S  =  75.0% 
Compac  d  to  a  Dry  Density  of  1.715  g/cc  (107.0  pcO- 
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TABLE  3.  Summary  of  Particle  Orientation  Analysis  for  Particle 
Orientation  Angle,  6  (degrees),  as  a  Function  of  Saturation. 


Saturation  (%) 

k,  d,  1  (a) 

c,  d,  e  Ih) 

k,  d,  1,  c,  e  (c) 

0 

-30.  -20,  +20,  +90 

-50  to  -20 

-40,  -30,  -20 

18.7 

-50  to -10.  +20 

-40,-20,  +20 

-40,  +10,  +20 

37.4 

-40,  -10,  +10 

-30,  -10,  +30 

-10,  +10,  +30 

56.2 

-20,  -10,  +10 

-90,  -20,  +10 

-90,  -20,  +10 

75.0 

-20,  +10,  +40 

-30,  +10,  +40 

-20  to  +10,  +40 

Note;  ^Refer  to  Figures  14-18  (see  Figure  6  also). 

^Refer  to  Figures  19-23  (see  Figure  6  also). 

‘^Refer  to  Figures  24-28  (see  Figure  6  also). 

Results  shown  are  based  on  the  three  largest  peaks  observed  in  the  data. 
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VI.  CONCLUSIONS 


Based  on  the  results  of  this  investigation,  several  conclusions  can  be  drawn  with 
regards  to  the  behavior  of  compacted  unsaturated  Ottawa  20-30  sand: 

1)  The  compactive  energy  required  to  produce  a  constant  dry  density  specimen  by 
dynamic  compaction  is  strongly  dependent  on  the  amount  of  moisture  )sent. 
The  largest  amount  of  effort  is  needed  to  compact  specimens  at  about  20-50% 
saturadon,  while  the  least  amount  is  for  dry  packing.  This  can  be  attributed  to 
variadons  in  overall  specimen  sdffness  with  moisture  which  may  be  due  to  the 
formation  of  preferred  particle  orientations  and  the  presence  of  capillary 
pressures  during  compaction. 

2)  For  SHPB  specimens  compacted  moist  and  tested  moist,  the  transmission  ratio 
and  wave  speed  increase  from  0%  to  about  25%  saturation,  remain  constant  to 
about  60%  saturation,  and  then  decrease  thereafter.  An  interesting  feature  in  the 
data  is  that  magnitudes  above  about  60%  saturation  are  lower  than  those  at  0% 
saturation.  This  implies  that  the  presence  of  moisture  during  testing  may  have  a 
lubricating  effect  as  the  compressive  energy  passes  through  the  specimen. 
Particles  would  then  be  able  to  move  somewhat  more  freely  relative  to  each 
other  and  absorb  more  energy. 

3)  For  SHPB  specimens  compacted  moist  and  tested  dry,  the  transmission  ratio 
and  wave  speed  are  generally  larger  than  for  specimens  compacted  moist  and 
tested  moist,  particularly  at  about  20%  saturation,  after  which  they  decrease 
with  increasing  saturation.  In  addition,  no  plateau  region  exists  and  the  wave 
speed  remains  above  that  at  0%  saturation  regardless  of  the  moisture  condition 
during  compaction.  Therefore,  it  appears  that  soil  microstructural 
characteristics  developed  during  compaction  which  influence  the  transmission 
ratio  and  wave  speed,  remain  intact  even  after  the  moisture  in  the  pores  has  been 
removed. 
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4)  The  SHPB  data  are  in  general  agreement  with  the  findings  of  Ross  (1989).  In 
addition,  the  results  are  very  similar  to  those  of  Hughes  and  Kelly  (1952)  who 
directly  measured  variations  in  dilatational  velocity  with  saturation  and 
confining  pressure  on  rock  core  specimens.  This  indicates  that  locked  in 
stresses  from  compaction  in  the  SHPB  tests,  and  from  confining  pressure  in  the 
rock  core  tests,  may  be  a  significant  factor  affecting  the  dynamic  response  in 
addition  to  the  moisture  condition. 

5)  For  the  microstructural  data  analyzed  to  date,  there  is  some  weak  dependance  of 
particle  orientation  on  moisture  conditions  during  packing  and  no  particularly 
strong  dependence  was  evident  at  any  saturation.  However,  this  may  be  due  to 
the  relatively  small  number  of  particles  examined  and  the  results  may  change 
when  the  remaining  data  is  analyzed.  In  addition,  there  may  also  be  some 
important  three-dimensional  effects  which  are  not  evident  in  the  two- 
dimensional  analysis  performed  in  this  study. 

VII.  RECOMMENDATIONS 

The  research  reported  herein  has  provided  some  valuable  insight  into  the  behavior 
of  unsaturated  cohesionless  soils.  Based  on  this  work,  several  recommendations  can  be 
made  for  further  studies: 

1)  An  investigation  should  be  made  to  study  the  behavior  of  Ottawa  20-30  and 
other  sands  (such  as  Eglin  and  Tyndall  sands)  at  different  dry  unit  weights  (ie., 
variations  in  compactive  energy)  to  develop  correlations  among  dry  density, 
compactive  energy,  stress  transmission,  saturation  and  microstructural 
parameters.  This  would  also  provide  data  on  the  effects  of  particle  size,  grain 
size  distribution  and  panicle  shape. 

2)  Tests  should  be  conducted  to  assess  the  influence  of  boundary  conditions  on  the 
static  and  dynamic  properties  of  unsaturated  sand.  This  would  require  the 
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fabrication  of  an  appropriate  cell-type  device  capable  of  applying  controlled 
confining  pressures  to  a  compacted  specimen  of  unsaturated  sand  that  could  be 
used  in  both  the  SHPB  and  a  pseudo-static  loading  system  such  as  the  MTS. 
Some  general  information  about  such  behavior  could  also  be  obtained  by 
performing  tests  on  unsaturated  specimens  compacted  in  containers  having 
different  wall  thicknesses  to  simulate  differing  degrees  of  confinement. 

3)  Some  pulse  dilatational  wave  velocity  measurements  should  be  made  on 
compacted  unsaturated  sands  at  various  confining  pressures  in  a  modified 
triaxial  testing  apparatus.  This  will  provide  useful  non-destructive  data  about 
the  influence  of  pressure  on  dynamic  response  that  could  be  correlated  with 
high  intensity  transient  SHPB  data  and  shear  strength  data.  In  addition,  the 
triaxial  device  could  also  be  used  to  apply  anisotropic  confinement  which  would 
more  closely  simulate  in  situ  conditions. 

4)  Microstructural  studies  should  be  continued  and  expanded  to  investigate  the 
influence  of  variations  in  compactive  energy,  saturation,  boundary  conditions 
and  material  type  on  the  static  and  dynamic  behavior  of  soils.  In  addition, 
studies  should  be  initiated  to  examine  three-dimensional  spatial  orientations  in 
the  compacted  grain  matrix  structure.  This  may  yield  useful  results  since  some 
feamres  are  inevitably  lost  when  viewing  two-dimensional  sections. 
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The  reaction  of  anhydrous  lithium  nitrate  and  trif luoromethanesulfonic 
anhydride  (triflic  anhydride)  in  refluxing  nitromethane  solvent  was 
found  to  be  the  most  convenient  method  for  preparing  nitronium 
trif luoromethylsulf onatP  (.nitronium  triflate).  Nitronium  triflate  in 
anhydrous  nitromet...-  was  found  to  be  a  convenient  nitrating  agent 
for  a  limited  r jr  or  of  organic  compounds.  Bromobenzene  was  converted 
in  75%  yield  into  ortho-  and  para-bromonitrobenzene.  The  objective  of 
the  research  was  to  N-nitrate  nitrogen  heterocycles.  Nitration  was 
successful  only  with  2-pyrrolidone.  N-Nitro-2-pyrrolidone  was  obtained 
in  46%  yield. 
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I.  INTRODUCTION:  It  has  been  recognized  for  many  years  that  many 

conventional  World  War  II  explosives  are  too  sensitive  for  use  in  the 
high  technology  environment  of  modern  ships  and  aircraft.  New  insensi¬ 
tive  high  explosives  are  needed  for  use  in  the  costly  delivery  systems 
used  in  modern  warfare.  Classical  nitration  methods  are  too  vigorous 
to  be  used  for  nitration  of  many  potentially  interesting  heterocyclic 
ring  systems.  The  need  for  new  methods  of  nitration  was  recognized  in 
an  authoritative  review  by  the  Explosives  Technology  Group  at  the  Los 
Alamos  National  Laboratory  [ 1 ] .  A  review  of  newer  nitration  techniques 
was  presented  by  th  s  investigator  in  an  earlier  report  written  for  the 
1989-1990  USASF-UES  Summer  Faculty  Research  Program  [2].  In  reference 
2  several  other  references  are  presented  which  describe  the  work  upon 
which  the  earlier  research  and  the  present  research  was  based.  For  the 
convenience  of  the  reader  the  references  cited  in  reference  2  are  cited 
again  in  this  report  [3-12]. 

In  earlier  scouting  work  an  unreliable  and  erratic  method  was  developed 
for  iji_  situ  synthesis  of  nitronium  triflate  (nitronium  trif luoromethane- 
sulfonate)  from  lithium  nitrate  and  triflic  anhydride  (trifluoromethane- 
sulfonic  acid),  Nitronium  triflate  was  found  to  C-nitrate  bromobenzene 
and  to  N-nitrate  2-pyrrolidone  but  these  reactions  were  never  developed 
to  the  point  of  reliability  and  reproducibility.  The  purpose  of  the 
present  work  was  to  develop  the  earlier  work  and  apply  it  to  nitration 
of  nitrogen  heterocyclic  compounds.  The  major  goals  of  the  research 
conducted  for  this  report  were: 

1.  To  achieve  one-flask  nitrations  of  heterocyclic  compounds  using 
nitronium  triflate  formed  in  situ  in  a  low-boiling  solvent  of 
low-polarity  such  as  dichloromethane.  If  this  was  not  possible 
then : 

2.  To  achieve  one-flask  nitration  of  heterocyclic  compounds  using 
nitronium  triflate  formed  in  situ  in  a  polar  solvent  such  as 
nitromethane. 

3.  To  form  nitronium  triflate  from  metal  nitrate  salts  other  than 
lithium  or  ammonium  nitrates  (calcium  nitrate  in  particular). 


34^ 


II.  SYNTHESIS  OF  NITRONIUM  TRIFLUOROMETHANESULFONATE:  Initially 
the  synthesis  of  nitronium  trifluoromethanesulfonate,  trivially  named 
nitronium  triflate,  was  carried  out  in  dichlormomethane  solvent  by 
reacting  dry  pulverized  lithium  nitrate  with  trif luoromethanesulf onic 
anhydride  (triflic  anhydride).  It  was  observed  that  nitronium  trif¬ 
late  formed  in  refluxing  dichloromethane  over  an  extended  period  of 
twelve  hours  to  five  days.  The  rates  of  reaction  and  yields  of 
product  were  erratic. 

LiN03  +  CF3S(02)-0-S(02)  - >  [NO2+]  [-OS(02)CF3]  +  CF3SO3H 

lithium  triflic  nitronium  triflic 
nitrate  anhydride  triflate  acid 

When  nitronium  triflate  was  successfully  synthesized,  it  could  be 
determined  by  visual  observation.  Solid  nitronium  triflate  sublimed 
onto  a  cold  finger  above  dichloromethane  or  precipitated  on  the  side 
of  the  flask  at  the  air  dichloromethane  interface.  In  the  earlier 
work  [2]  nitronium  triflate  was  not  isolated  but  was  reacted  in  the 
initial  reaction  flask  by  adding  a  dichloromethane  solution  of  2-pyr- 
rclidone  or  bromobenzene  to  the  initital  flask.  In  two  experiments 
2-pyrrolidone  was  nitrated  to  form  N-nitropyrrolidone  in  30-35%  yield, 
but  in  most  experiments  other  products  dominated.  The  synthesis  of 
nitronium  triflate  was  further  demonstrated  by  nitration  of  bromo¬ 
benzene  to  form  a  mixture  of  ortho-  and  para-bromonitrobenzene. 


The  unpredictable  and  erratic  formation  of  nitronium  triflate  in  di- 
chlormethane  forced  abandonment  of  this  preferred  solvent  in  favor  of 
higher  boiling  nitromethane  and  its  very  high  dielectric  constant  and 
accompanying  problems  with  separations  in  the  workup.  A  consistent 
and  reliable  formation  of  nitronium  triflate  was  achieved  by  reacting 
lithium  nitrate  and  triflic  anhydride  for  one  hour  in  nitro  methane. 
T.ie  synthesis  of  nitronium  triflate  was  confirmed  adding  bromobenzene 
and  isolating  the  nitration  products. 
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Experimental  Procedure  for  Reaction  of  Lithium  Xitrate  with  Triflic 


Anyhdride.  Lithium  nitrate  was  pulverized  with  a  mortar  and  pestle 
dried  in  an  oven  at  185-190^  for  a  minimum  of  one  day.  Dried  lithium  , 

nitrate  and  flamed-dried  glassware  were  stored  in  an  oven  at  greater 
than  100°.  When  glassware  was  assembled,  a  stream  of  dry  argon  was 
used  to  maintain  dryness.  A  100-ml,  3-necked  flask  was  fitted  with  a 
compensating  dropping  funnel,  reflux  condenser,  gas  inlet  tube  and  a  « 

magnetic  stirring  bar.  Before  assembling  the  apparatus  a  stoichio¬ 
metric  amount  (e.g.,  1.13  g,  16.3  mmol)  of  dried  lithium  nitrate  was 

added  to  the  hot  dry  reaction  flask.  After  assembly  of  the  apparatus 
one  stoichiometric  amount  of  triflic  anhydride  (e.g.,  2.2  ml,  13.1  , 

mi.^l)  in  10  ml  of  dry  nitromethane  was  added.  The  anhydrous  mixture 
was  heated  one  hour  under  reflux  and  cooled  over  a  two-hour  period 
while  under  an  argon  atmosphere.  At  the  beginning  of  the  reaction  all 
white  solid  (lithium  nitrate,  sp.  gr.  2.35)  was  at  the  bottom  of  the 
flask  under  the  nitromethane  solvent  (sp.  gr.  1.68).  After  reaction 
white  crystals  floated  on  the  nitromethane  surface  and  adhered  to  the 
flask  walls  at  the  air/solvent  interface.  A  small  amount  of  white 
solid  (LiN03)  remained  at  the  bottom  of  the  flask. 

III.  NITRATION  OF  BROMOBENZENE  AS  AN  ASSAY  FOR  NITRONIUM  TRIFLATE: 

The  last  reaction  carried  out  during  the  1989  USASF-UES  Summer  Faculty 
Research  Program  was  nitration  of  bromobenzene  with  nitronium  triflate 
formed  in  situ  from  lithium  nitrate  and  triflic  anhydride  in  nitro¬ 
methane.  A  mixture  of  ortho-  (32%)  and  para-bromonitrobenzene  (68%) 
was  separated  from  excess  unreacted  bromobenzene  and  a  small  amount  of 
bromo-2 , 4-dinitrobenzene .  In  this  study  we  have  used  the  nitration  of 
bromobenzene  as  a  convenient  assay  for  the  formation  of  nifionium 
triflate  because  the  volatile  bromobenzene  is  easily  removed  at 
reduced  pressure  from  the  much  less  volatile  bromonitrobenzenes  which 
can  be  conveniently  identified.  The  weight  of  nonvolatile 
bromonitrobenzenes  can  be  quickly  equated  to  the  amount  of  nitronium 
triflate  formed.  The  yield  of  bromonitrobenzenes  were  consistently 
65-77%  based  on  triflic  anhydride. 


34-6 


The  chemistry  described  on  the  preceding  pages  is  summarized  by  the 
following  two  equations: 


LiN03  +  CF3S(02)-0-S(02)  - ^  [NO2+]  [-0S(02)CF3]  +  CF3SO3H 

lithium  triflic  nitronium  triflic 

nitrate  anhydride  triflate  acid 


[NO2+]  [-0S(02)CF3]  +  C6H5Br  - o-  and  p-Br(C6H4)N02  +  CF3SO3 

nitronium  triflate  bromo-  bromonitrobenzene  triflic 

benzene  ortho(32%),  para(68%)  acid 

Table  I  which  follows  presents  a  representative  summary  of  nitration 
reactions  of  bromobenzene  which  were  used  as  an  assay  for  formation  of 
nitronium  triflate  from  lithium  nitrate  and  triflic  anhydride. 


TABLE  I.  NITRATION  OF  BROMOBENZENE  AS  AN  ASSAY  FOR  NITRONIUM  TRIFLATE 
R  Lithium  Triflic  Solvent  Time  of  Bromobenzene  Bromonitrobenzene 


u  Nitrate 

Anhydride 

NM* 

Reflux 

ortho  +  para 

n  g  (mmol) 

g  (mmol)  ' 

or  pOi 

hr/days 

g  (mmol) 

g  (mmol)  %  yield 

1.  0.689 

(10.0) 

2.82  (10.0) 

DCM 

6  hr 

0.157 

(10.0) 

<0.1  g,  nil 

2.  0.689 

(10.0) 

2.82  (10.0) 

DCM 

18  hr 

0.157 

(10.0) 

0.2  (2.0)  10% 

3.  0.689 

(10.0) 

2.82  (10.0) 

DCM 

3  days 

0.157 

(10.0) 

0.5  (2.50)  25% 

4.  0.689 

(10.0) 

2.82  (10.0) 

DCM 

7  days 

0.157 

(10.0) 

0.6  (3.0)  30% 

5.  0.689 

(10.0) 

2.82  (10.0) 

NM 

6  hr 

0.157 

(10.0) 

1.5  (7.5)  75% 

6.  0.689 
(10.0) 

2.82  (10.0) 

NM 

1  hr 

0.157 

(10.0) 

1.5  (7.5)  75% 

7.  1.38 
(20.0) 

5.64  (20.0) 

NM 

1  hr 

0.314 

(20.0) 

3.19  (15.8)  79% 

*  NM  is  nitromethane ;  DCM  is  dichloromethane. 


A  literature  survey  was  made  of  the  solubilities  of  alkali  and 
alkaline  earth  nitrates  in  organic  solvents  in  the  hope  of  finding  a 
nitrate  more  suitable  for  synthesis  of  nitronium  triflate  than  lithium 
nitrate  or  ammonium  nitrate.  Table  II  presents  some  of  the  solubility 
data  which  was  found  [8].  Table  III  is  discussed  on  the  next  page. 

TABLE  II.  SOLUBILITIES  OF  GROUP  I  AND  II  SALTS  IN  ANHYDROUS  ‘'ETHANOL 
Grams  salt/lOO  grams  anhydrous  methanol 


Fluoride 

Chloride 

Bromide 

Iodide 

Nitrate 

Carbonate 

Sulfate 

(1.36) 

(1.81) 

(1.95) 

(2.16) 

Lithium 

0.0176 

20.98 

34.29 

42.95 

0.0555 

0.1261 

ro.60) 

Sodium 

0.0231 

1.401 

16.09 

62.51 

2.936 

0.3109 

0.0113 

(0.95) 

Potassium 

2.286 

0.5335 

2.080 

17.07 

0.380 

6.165 

0.0005 

(1.33) 

Calcium 

0.0145 

23.26 

55.83 

67.37 

127.13 

0.0012 

0.0046 

(0.99) 

Strontium 

0.0142 

18.05 

1.061 

0.0014 

0.0074 

(1.13) 

Barium 

0.0044 

1.379 

0.048 

0.0064 

0.0063 

(1.35) 

(Ionic  radii  in  Angstroms  are  given  in  parentheses.) 


TABLE  III.  BROMOBENZENE  ASSAY  FOR  NITRONIUM  TRIFLATE 
FROM  CALCIUM  NITRATE  AND  TRIFLIC  ANHYDRIDE 

R  Calcium  Triflic  Solvent  Time  of  Bromobenzene  Bromonitrobenzene 


u  Nitrate 
n  ^  (mmol) 

Anhydride 
g  ( mmo 1 ) 

NM* 

or  DCM 

Reflux 

hr/days 

g  (mmol) 

ortho  +  para 
g  (mmol)  %  yield 

1 .  1.64 

10.0) 

2.82 

(10.0) 

DCM 

5  hr 

0.157 

(10.0) 

0.0  (0.0)  0%  No 
evidence  of  rxn. 

2 .  1.64 

(10.0) 

2.82 

(10.0) 

DCM 

3  days 

0.157 

(10.0) 

0.0  (0.0)  0%  No 
evidence  of  rxn. 

3.  1.64 

(10.0) 

2.82 

(10.0) 

NM 

12  hr 

0.157 

(10.0) 

0.0  (0.0)  0%  No 
evidence  of  rxn. 

4.  1.64 

(10.0) 

2.82 

(10.0) 

NM 

3  days 

0.157 

(10.0) 

0.0  (0.0)  0%  No 
evidence  of  rxn. 

*  NM  is  nitromethane ;  DCM  is  dichloromethane. 
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As  seen  in  Table  II,  calcium  nitrate  is  very  soluble  in  methanol.  For 
this  reason  we  expected  it  to  be  soluble  in  nitromethane  and  react 
readily  ith  triflic  anhydride  to  form  nitronium  triflate.  Calcium 
nitrate  did  not  react  with  triflic  anhydride  in  either  dichloromethane 
or  nitromethane.  The  results  of  four  attempted  reactions  are 
summarized  in  Table  III  presented  on  the  previous  page. 

IV.  ONE-FLASK  NITRONIUM  TRIFLATE  NITRATION  OF  2-PYRROLIDONE  TO 

FORM  N-NITR0-2-PYRR0LID0NE;  During  scouting  research  in  Summer  1989, 
2-pyrrolidone  was  successfully  nitrated  by  nitronium  triflate  to  give 
N-nitro-2-pyrrolidone  in  27-33%.  The  nitrations  were  not  consistent 
and  in  many  attempted  one-flask  reactions,  no  N-nitro-2-pyrrolidone 
was  obtained.  The  inconsistency  of  results  was  believed  to  be  due  to 
inconsistent  formation  of  the  nitrating  agent,  nitronium  triflate. 

In  the  work  for  this  report,  N-nitro-2-pyrrolidone  was  obtained  in 
consistent  35-46%  yields  where  nitronium  triflate  was  prepared  in 
nitromethane  as  described  in  Section  II  of  this  report.  Major  Shack¬ 
elford  [13]  suggested  that  the  maximum  yield  of  N-nitro-2-pyrrolidone 
may  only  be  50%  because  of  concomitant  0-nitration.  In  Chart  I, 
structures  are  shown  for  the  various  products  that  have  been  formed  in 
the  nitronium  triflate  nitration  of  2-pyrrolidone. 

Table  IV  summarizes  many  of  the  successful  and  unsuccessful  attempts 
to  prepare  N-nitro-2-pyrrolidone.  The  highest  yield  of  crude  N-nitro- 
2-pyrrolidone  based  on  triflic  anhydride  was  55%  when  a  100%  excess  of 
2-pyrrolidone  was  used.  Analysis  of  crude  product  by  nmr  spectroscopy 
resulted  in  an  estimate  of  90%  purity  which  gives  a  yield  of  49%  which 
is  less  than  the  50%  figure  suggested  by  Major  Shackelford  [13]. 

Experimental  Procedure  for  Synthesis  of  N-Nitro-2-pyrrolidone.  Flame- 
dried  glassware  was  stored  in  an  oven,  assembled  while  hot,  and  main¬ 
tained  under  a  dry  argon  atomosphere.  Oven-dried  (190°)  lithium 
nitrate  (2.03  g,  0.0294  mol)  and  a  1-cm  magnetic  stirring  bar  were 
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Products  from  the  Nitronium  Triflate  Nitration  of  2-Pyrrolidone 


0 


HOCH2CH2CH2COOH 


H2NCH2CH2CH2COOH 


III 


IV 


N~N=0 


TABLE  IV.  NITRATION  OF  2-PYRROLIDONE  WITH  NITRONIUM  TRIFLATE 
TO  FORM  N-NITRO-2-PYRROLIDONE  (NN2P) 


R 

u 

n_ 

1. 


2. 


3. 


4. 


5. 


6. 


7. 


8. 


9. 


Lithium 

Triflic 

Solvent 

Condi¬ 

2-Pyrrol¬ 

Na2S04 

Product  Data 

Nitrate 

Anhydride 

DCM 

tions 

idone 

g 

refer  to  Chart  I 

g  ( mmo 1 ) 

g  (mmol) 

or  m 

t.T 

g  (mmol) 

( mmo 1 ) 

misc  information 

2.85 

5.06 

DCM 

24  hr 

1.70 

5.70 

84%  I,  16%  II 

(41.3) 

(20.2) 

50  ml 

reflux 

(20.0) 

(40.4) 

no  NN2P 
see  CHART  I 

10.0 

(145) 

5.72 

(20.3) 

none 

24  hr 
reflux 

1.73 

(20.3) 

none 

18%  2P,  52%  I 
no  NN2P 

3.0 

(43.5) 

4.98 

(17.7) 

none 

14  hr 
reflux 

1.72 

(20.2) 

none 

0.134  g,  5%  NN2P 
95%  pure  NN2P 

none 

5.77 

(20.5) 

DCM 

50  ml 

2P 

added 

1.70 

(20.0) 

6.0 

(42) 

I-V  from  Chart  I 

2.75 

5.64 

none 

14  hr 

1.70 

5.68 

0.80  g  (6.0  mmol) 

(40.0) 

(20.0) 

reflux 

(20.0) 

(40) 

31%  NN2P,  trace  VI 

2.75 

5.64 

NM 

4  hr 

1.70 

none 

0.20  g  (1.6  mmol ) 

(40.0) 

(20.0) 

1.02  g 

room 

(20.0) 

7%  NN2P,  trace  VI 

1.00 

3.69 

NM 

6  hr 

0.57 

1.89 

0.13  g  NN2P  by 

(14.6) 

(13.1) 

10  ml 

reflux 

(6.7) 

(13.3) 

ir,  nmr 

1.90 

2.60 

NM 

2  hr 

2.69 

4.40 

1.14  g  crude  NN2P 

(27.5) 

(15.6) 

15  ml 

950C 

(31.0) 

(31.0) 

0.91  g  pure,  45% 

2.03 

4.20 

NM 

1  hr 

2.58 

4.23 

0.93  g  NN2P 

(29.4) 

(14.9) 

10  ml 

950c 

(29.8) 

(29.8) 

mp  =  53-550C 

Notes  on  Columns  of  Table  IV 


Run : 

Lithium  Nitrate: 
Triflic  Anhydride; 
Solvent : 


Arbitrary  #;  do  not  include  all  experiments. 
Anhydrous  from  Baker's  LiN03  trihydrate. 
Aldrich  best  grade, 

DCM  =  dichloromethane ;  NM  =  nitromethane. 


Na2S04 
Chart  I 
2P  and  NN2P 


Used  as  base  to  remove  triflic  acid. 

Stuctures  of  products  given  on  page  10. 
2-Pyrrolidone  and  N-Nitro-2-Pyrrolidone. 
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added  to  a  three-necked  100-ml  round-bottomed  flask  which  was  equipped 
with  a  cold  finger,  condenser  and  inlet/outlet  tubes  for  maintaining 
an  argon  atmosphere.  Nitromethane  (10  ml)  was  added  to  a  dry  compensa¬ 
ting  dropping  funnel  which  was  placed  at  the  top  of  the  vertical  con¬ 
denser.  A  dry  syringe  was  used  to  add  2.60  ml  of  triflic  anhydride 
(Aldrich  catalog  15,853-4,  d  =  1.696,  4.40  g,  15.6  mmol)  to  the  nitro¬ 
methane  in  the  compensating  dropping  funnel.  The  nitromethane/trif lie 
anhydride  solution  was  added  rapidly  to  the  dry  lithium  nitrate  and, 
magnetic  stirring  was  started.  The  syringe,  dropping  funnel,  and  con¬ 
denser  were  washed  with  5  ml  of  nitromethane  into  the  reaction  flask. 

The  flask  was  heated  at  95°C  by  an  oil  bath  for  1.5  hours  and  then 
permitted  to  cool  to  room  temperature.  Dry  sodium  sulfate  was  added 
(4.40  g,  31.0  mmol)  and  the  reaction  mixture  stirred  for  one  addition¬ 
al  hour  in  order  to  neutralize  triflic  acid  formed  concomitantly  with 
nitronium  triflate  as  illustrated  by  the  following  word  equations: 

lithium  nitrate  +  triflic  anhydride  =  nitronium  triflate  +  triflic  acid 
sodium  sulfate  +  triflic  acid  =  sodium  triflate  +  sodium  bisulfate 

Finally,  2.40  ml  of  2-pyrrolidone  (Aldrich  catalog  24,033-8,  d  =  1.120 
2.69  g,  31.6  mmol)  in  5  ml  of  nitromethane  was  added  dropwise  to  the 
stirred  reaction  solution  over  a  30-minute  period.  The  reaction  mix¬ 
ture  became  yellow.  Stirring  of  the  slurry  was  maintained  with  diff¬ 
iculty  until  the  reaction  mixture  was  warmed  to  30-35oC.  In  some  runs 
it  was  necessary  to  add  an  additional  3-5  ml  of  nitromethane  to  assure 
continuous  stirring.  After  24  hours  stirring  was  stopped  and  solids 
separated  by  vacuum  filtration  and  washed  with  2-3  ml  of  nitromethane. 
N'itromethane  filtrates  were  combined  and  added  to  a  separatory  funnel 
where  they  were  extracted  three  times  with  20-ml  portions  of  water. 

The  water  extracts  were  combined  and  washed  three  times  with  20-ml 
portions  of  dichloromethane.  Each  of  the  dichloromethane  extracts 
were  evaporated.  The  first  two  dichloromethane  extracts  gave  1.4  g  of 
2-pyrrolidone  which  was  identified  by  its  ir  and  nmr  spectra.  The 
residue  from  the  third  dichloromethane  extract  was  less  than  0.05  g. 
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The  nitrometbane  filtrate  was  dried  over  magnesium  sulfate  and  the 
solvent  distixxed  under  reduced  pressure  and  the  pumped  on  at  1  mm  to 
remove  residual  solvent.  The  residue  amounted  to  0.93  g  (7.1  mmol) 
which  is  a  46%  yield  based  on  the  amount  of  triflic  anhydride,  the 
most  expensive  reagent,  used.  The  theoretical  yield  of  N-nitro-2-pyr- 
rolidone  is  4.11  g  based  on  2.69  g  of  2-pyrrolidone  used  initially. 
Because  1.4  g  of  2-pyrrolidone  was  recovered,  the  net  2-pyrrolidone 
was  1.3  g  which  gives  a  theoretical  yield  of  1.98  g.  The  percentage 
yield  of  N-nitro-2-pyrrolidone  becomes  47%  based  on  consumed  2-pyrrol¬ 
idone  which  is  remarkably  close  to  46%  based  on  triflic  anhydride  and 
completely  consistent  with  Major  Shackelford's  suggestion  [13]  which 
requires  two  moles  of  triflic  anhydride  per  mole  of  2-pyrrolidone 
rather  than  the  1:1  ratio  that  we  had  assumed  initially. 

V.  ONE-FLASK  NITRONIUM  TRIFLATE  NITRATION  OF  2-PYRROLIDONE  TO 

FORM  N-NITR0-2-PYRR0LID0NE:  After  the  formation  of  nitronium  triflate 
from  lithium  nitrate  and  triflic  anhydride  in  nitromethane  was 

confirmed  by  the  successful  nitration  of  bromobenzene,  the  nitration 
of  2-imidizolidone  was  attempted.  In  Table  V  the  attempts  to  nitrate 
2-imidizolidone  with  nitronium  triflate  are  summarized.  Table  V  gives 
the  amounts  of  reagents  and  a  brief  description  of  the  results  for 
those  experiments  which  were  carried  through  the  workup  procedure.  A 
variety  of  workup  procedures  were  tried  but  none  succeeded  in 

isolating  nitrated  compounds.  Imidizolidone  nitration  experiments 
which  were  abandoned  are  not  included.  The  solvent  for  all 

experiments  was  nitromethane. 

Infrared  and  nmr  spectra  and  melting  points  were  observed  for  all 
solid  products  isolated  in  the  experiments  reported  in  Table  V.  A 
flame  test  on  a  platinum  wire  was  used  to  identify  nonmelting  or 
high-melting  solids  as  lithium  salts.  Lithium  salts  give  an  easily 
identified  red  flame  which  clearly  differentiates  them  from  sodium  or 
potassium  salts.  Most  of  the  products  isolated  were  lithium  salts. 
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TABLE  V.  NITRATION  OF  2-IMIDIZOLIDONE  WITH  NITRONIUII  TRIFLATE 


Run , 
NB 

ID 

Lithium 
Nitrate 
g , ( mmo 1 ) 

Mis  Triflic 
NM  Anhydride 
Solv  g  fmmol) 

Reaction  2- 
Conditions 
time,  temp  g 

Imidizol- 
idone 
( mmo 1 ) 

Comments  and 
other 

information 

• 

1 . 

0.445 

7.5  0.86  ml 

4  hr,  I05OC 

0.522 

14  hr,  rm  temp 

1-44 

(5.11) 

1.44(5.11) 

NT  crystals 

(6.06) 

0.1  g  benzoic 

NaOH  workup 

2. 

0.984 

15  2.00  ml 

4  hr,  1050c 

0.594 

0.1  g  benzoic 

1-50 

(14.1) 

3.35(11.9) 

NT  crystals 

(6.90) 

mp  120-121 

NaOH  workup 

• 

3. 

0.987 

10  1.70  ml 

4  hr,  1050c 

0.457 

0.1  g  benzoic 

1-56 

(14.3) 

2.85(10.1) 

NT  crystals 

(5.30) 

NaOH  workup 

4. 

1.828 

10  3.70  ml 

4  hr  reflux 

0.947 

DCM  extraction 

1-60 

(26.5) 

6.20(22.0) 

NT  crystals 

(11.0) 

Only  salts 
+Li  flame  test 

1 

5. 

0.918 

10  1.90  ml 

4  hr  reflux 

0.885 

CF3SO2  in  ir 

1-65 

(13.3) 

3.18(11.3) 

clear  soln 

(10.2) 

no  NO2  in  ir 
+Li  flame  test 

6. 

0.887 

12  1.80  ml 

2.5  hr  reflux 

0.850 

only  salts,  ir 

1-68 

(12.9) 

3.02(10.7) 

NT  crystals 

(9.9) 

red  flame 

7. 

1.821 

12  2.60 

2.0  hr  reflux 

0.812 

benzoic  by  ir 

1-70 

(26.5) 

4.36(15.5) 

NT  crystals 

(9.4) 

NaOH  workup 

Notes  on  Columns  of  Table  _V 

Run/NB  ID:  Arbitrary  number  and  notebook  page  reference. 

Lithium  Nitrate:  Anhydrous  from  LiN03-3H20;  grams,  millimoles. 

Mis  NM  Solv:  Milliliters  of  Nitromethane  solvent  used. 

Triflic  Anhydride:  Aldrich,  15,853-4;  milliliters,  grams,  millimoles. 
Rxn.  Conditions:  for  forming  Nitronium  Triflate  (NT); 

NT  Crystals  means  that  NT  observed  at  NM  surface. 
2-Imidizolidone  Grams,  millimoles  of  2-1  added  after  NT  formed. 


It  is  difficult  to  see  how  2-imidizolidone  can  be  converted  to  benzoic 
acid.  It  was  observed  in  those  experiments  in  which  sodium  hydroxide 
was  used  in  the  workup  and  also  in  one  experiment  in  which  a  neutral 
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workup  was  used.  The  experimental  procedure  for  one  of  the  experiments 
is  described  in  the  next  paragraph.  In  the  run  described  there  was  an 
excess  of  nitronium  triflate.  The  target  of  the  reaction  was  the  known 
compound  N, M '-dinitro-2-imidazoiidone,  with  melting  point  216-217^C. 

Run  5 ,  Table  V ;  Nitration  of  2-Imidizolidone .  In  a  dry  100-ml  flask 
under  a  nitrogen  atmosphere  were  mixed  0.918  g  (13.3  mmol)  of  lithium 
nitrate,  1.90  ml  (3.18  g,  11.3  mmol)  of  triflic  anhydride,  and  10  ml 
of  nitromethane  solvent.  The  mixture  was  heated  and  magnetically 
stirred  under  reflux  for  four  hours.  During  the  four  hours  crystals 
of  lower  density  than  nitromethane  formed  at  the  air-nitromethane 
interface  and  the  flask  side  above  the  interface.  Lithium  nitrate 
remained  at  the  bottom  until  reacted.  After  nitronium  triflate  was 
formed,  0.885  g  of  2-imidizolidone  (10.2  mmol)  was  added  and  the  flask 
agitated  to  achieve  mixing.  All  crystals  dissolved  within  10  minutes 
except  for  the  small  amount  of  excess  lithium  nitrate  in  the  bottom  of 
the  flask.  Dry  sodium  sulfate  (3.3  g,  23.2  mmol)  was  added  to 
neutralize  triflic  acid.  Solids  were  then  separated  by  filtration. 

The  nitromethane  solution  was  extracted  with  three  10-ml  portions  of 
water.  The  nitromethane  solution  was  then  dried  over  0.5  g  MgS04  and 
the  MgS04  separated  by  filtraton.  Nitromethane  solvent  was  evaporated 
under  reduced  pressure  with  a  rotary  evaporator.  A  yellow  oil/crystal 
mass  remained  which  amounted  to  less  than  0.10  g.  A  few  white  crys¬ 
tals  (designated  I-65A)  were  on  the  upper  portion  of  the  evaporating 
flask.  The  infrared  spectrum  of  U-I-65A  did  not  contain  absorbtions 
characteristic  os  C-Nitro  or  N-Nitro  groups  [very  strong  abssorptions 
at  1570-1540  or  1630-1530  cm-1;  strong  at  1390-1340  or  1315-1260 

cm-1].  On  the  basis  of  the  melting  point  (121oC)  and  infrared  absorp¬ 
tions  at  1604,  1584,  1496,  935,  708  and  685,  sample  I-65A  was  identi¬ 
fied  as  benzoic  acid,  Earlier  in  another  experiment,  a  similar  sample 
was  identified  as  benzoic  acid  by  mass  spectroscopy  performed  at  the 
Seiler  I.dboratory  at  the  U.  S,  Air  Force  Academy. 
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The  magnesium  sulfate  used  for  drying  the  nitromethane  layer  was 
washed  with  5  ml  of  dichloromethane.  The  water  extracts  of  the  nitro¬ 
methane  layer  were  extracted  with  three  10-mi  porticjns  of  dichloro¬ 
methane.  Dichloromethane  extracts  were  combined  and  solvent  removed 
with  a  rotary  evaporator.  The  residue  (designated  I-65B)  amounted  to 
less  than  0.1  g.  Residue  I-65B  was  identified  as  starting  material 
2-imidizolidone  by  its  infrared  and  nmr  spectra. 

The  original  water  extracts  of  the  reaction  solution  were  evaporated 
to  give  more  than  3  g  of  white  and  yellow  crystalline  and  amorphous 
solid  designated  as  1-65C1.  A  portion  of  I-65C1  was  washed  with 
hexane,  air  dried  and  labeled  I-65C2.  Sample  I-65C2  melted  over  a 
wide  range,  most  melting  occurring  at  197-210oC.  The  infrared  spectra 
of  I-65C1  and  I-65C2  were  very  similar  and  had  a  carbonyl  absorption 
at  1689  cm-1  which  was  probably  due  to  benzoic  acid.  A  broad  peak 
occurred  between  3300-3400  cm-1  (OH  or  NH).  A  very  strong  absorption 
at  1292  cm-1  was  consistent  with  the  trif luoromethyl  group.  There 
were  no  absorptions  consistent  with  a  nitration  product.  Vie  concluded 
I-65C1  and  I-65C2  were  essentially  the  same. 

The  nmr  spectrum  of  7  mg  of  I-65C1  in  0.8  ml  deuterium  oxide  showed 
absorptions  at  3.2,  3,3  and  4.5  ppm.  The  peak  at  4.5  is  a  water  peak 
which  is  merged  with  the  N-H  protons  of  2-imidizolidone.  Sample 
I-65C1  in  deuteroacetone  had  proton  absorptions  at  3.48  and  3.68  ppm. 
On  addition  of  authentic  2-imidizolidone  to  the  deuteroacetone  sol¬ 
ution  of  I-65C1  there  were  peaks  at  3.48,  3.68  and  4.36  ppm.  The 
major  organic  substance  is  recovered  2-imidizolidone. 

Additional  work  was  carried  out  on  samples  derived  from  I-65C1.  Many 
infrared  and  nmr  spectra  were  observed.  Melting  points  were  observed 
and  flame  tests  made.  The  conclusion  was  that  most  of  the  material  in 
the  water  layer  was  lithium  salts  contaminated  by  starting  material. 
Certainly  one  of  the  products  was  lithium  trif luoromethy Isulfonate . 
At  no  time  could  evidence  be  found  for  a  nitro  compound. 
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Discussion  of  Reaction  of  2-Imidizolidone  with  Nitronium  Trif late . 


This  discussion  assumes  that  lithium  nitrate  and  triflic  anhydride 
react  to  form  nitroniura  triflate.  It  is  possible  that  nitronium 
triflate  does  nitrate  2-imidizolidone  to  form  N-nitro-2-imidizolidin- 
one  and/or  N,N'-dinitro-2-imidizolidone,  but  if  they  are  formed,  they 
are  not  stable  to  the  reaction  conditions  or  the  conditions  of  work  up. 
The  formation  of  benzoic  acid  is  evidence  of  major  rearrangement.  Suri 
[3]  was  able  to  nitrate  2-imidizolidone  using  nitroniium  trif luoroace- 
tate  formed  from  ammonium  nitrate  and  trif luoroacetic  anhydride  to  give 
N,N'-dinitro-2-imidizolidone.  The  strongest  acid  present  for  Suri  was 
urif luoroacetic  acid.  We  have  the  very  strong  triflic  acid  present 
which  may  cause  our  difficulties. 

Beilstein  reports  that  2-imidizolidone  is  dinitrated  by  nitric  acid  to 
form  N,N'-dinitro-2-pyrrolidone.  This  dinitro  compound  undergoes 
hydolysis  in  base  to  form  N,N'-dinitroethylenediamine.  We  did  not  see 
evidence  for  this  compound  in  our  basic  workups  of  reaction  mixtures. 

Our  failures  may  be  due  to  the  possibility  of  0-nitration  as  well  as 
N-nitration.  Or  it  may  be  due  to  triflic  acid  present  in  the  in  situ 
nitrations.  It  is  apparent  that  we  must  isolate  nitronium  triflate 
and  use  it  as  a  nitrating  agent  under  neutral  conditions  or  under 
weakly  acidic  or  basic  solutions. 

VI.  ONE-FLASK  NITRONIUM  TRIFLATE  NITRATION  OF  PYRROLIDINE 

Attempts  were  made  to  nitrate  pyrrolidine  using  the  techniques 

developed  for  bromobenzene  and  2-pyrrolidone.  Suri  [3]  synthesized 
N-nitropyrrolidine  using  nitronium  trif luoroacetate  to  nitrate 

pyrrolidine.  He  reported  the  infrared  spectrum  of  this  compound. 
Four  of  our  experiments  are  summarized  in  Table  VI.  In  none  of  these 
experiments  did  we  find  evidence  for  a  nitro  compound. 
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TABLE  V.  NITRATION  OF  2-IMIDIZOLIDONE  WITH  NITRONIUM  TRIFLATE 


Run , 

Lithium 
Nitrate 
g , ( mmo 1 ) 

Mis 

NM 

S  Iv 

Trif lie 
Anhydride 
g  (mmol) 

Reaction 
Conditions 
time,  temp 

Pyrrolidine 

g  (mmol) 

Comments  and 
other 

information 

1. 

2.02 

10 

4.94 

1  hr 

1.36 

K3PO4  as  base 

(29.3) 

(17.5) 

reflux 

(19.2) 

no  products 

2 . 

1.13 

10 

3.69 

1  hr 

0.68 

K3PO4  as  base 

(16.4) 

(13.1) 

reflux 

(9.6) 

no  products 

3. 

1.57 

10 

6.03 

1  hr 

1.53 

K3PO4  as  base 

(22.8) 

(21.4) 

reflux 

(21.6) 

no  products 

4. 

2.3Q 

10 

5.37 

1  hr 

1.53 

K3PO4  as  base 

(33.4) 

(19.0) 

reflux 

121.6) 

no  products 

Notes  on  Columns  o£  Table  ^ 

Run/NB  ID:  Arbitrary  number. 

Lithium  Nitrate:  Anhydrous  from  LiN03*3H20;  grams,  millimoles. 

Mis  NM  Solv:  Milliliters  of  Nitromethane  solvent  used. 

Triflic  Anhydride:  Aldrich,  15,853-4;  milliliters,  grams,  millimoles. 
Rxn.  Conditions:  for  forming  Nitroniura  Triflate  (NT). 

Pyrrolidine:  Grams,  millimoles  of  P  added  after  NT  formed. 

VII.  SIGNIFICANT  FINDINGS  AND  CONCLUSIONS.  A  new  method  for 

synthesis  of  nitronium  triflate  has  been  developed.  Lithium  nitrate 
reacts  with  triflic  anhydride  in  refluxing  nitromethane  in  one  hour. 

We  have  found  that  nitronium  triflate  sublimes  and  can  be  isolated  but 
have  not  used  isolated  nitronium  triflate  for  nitration  reactions.  Our 
objectives  have  been  in  situ  nitration.  We  have  succeeded  only  in  ni¬ 
trating  bromobenzene  and  2-pyrrolidone.  Mixed  ortho-  and  para-bromoni- 
trobenzene  was  obtained  in  75%  yield;  N-nitro-2-pyrrolidone  was  ob¬ 
tained  in  46%  yield.  No  nitro  products  were  obtained  from  pyrrolidine 
or  2-imidizolidone .  We  did  not  achieve  the  goal  of  this  study,  the  in 
situ  nitration  of  nitrogen  heterocyclic  compounds.  We  believe  this  is 
due  to  the  presence  of  an  equivalent  of  triflic  acid  that  accompanies 
nitronium  triflate  and  the  presence  of  unreacted  triflic  anhydride. 
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Nitronium  triflate  can  be  isolated  by  sublimation  from  the  reaction 
flask  in  which  it  is  formed.  The  next  investigation  should  b^  the  ■'  e 
of  isolated  nitronium  triflate  in  neutral  solvents  with  a  ua£.e  present 
to  pick  up  triflic  acid  as  it  forms. 

VIII.  RECOMMENDATIONS  BASED  ON  SIGNIFICANT  FINDINGS  AND  t:ONCLUSIONS . 
No  further  investigations  of  in  situ  nitration  by  nitronium  triflate 
should  be  made.  Instead,  an  experimental  technique  for  isolating 
nitronium  triflate  should  be  accomplished.  Nitronium  triflate  should 
then  be  used  in  neutral  solvents  such  as  dichloromethane  in  the 
presence  of  a  base  such  as  sodium  sulfate  for  the  nitration  of 
difficultly  nitrate  organic  compounds. 

IX.  REFERENCES.  References  follow  on  the  next  two  pages. 
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This  experimental  research  program  focuses  on  the  active  control  of  dynamic 
stall  phenomena.  Previous  studies  of  constantly  pitched  and  oscillating  airfoils  have 
concluded  that  significant  enhancements  of  lift  occur  prior  to  vortex  manifestation 
on  the  airfoil  surface.  A  logical  subsequent  step  is  to  evaluate  means  of  controlling 
the  flow,  delaying  vortex  formation,  and  extending  enhanced  aerodynamic  force 
capabilities.  This  study  focuses  upon  identifying  control  parameters  necessary  to 
actively  force  the  unsteady  flow  over  a  NACA  0015  airfoil  pitched  at  constant  rates. 
The  characteristic  frequencies  of  the  specific  flows  are  first  determined,  and  then  a 
flow  visualization  analysis  is  performed  on  the  unsteady  flow  field  under  natural  and 
controlled  conditions.  Results  show  that  forcing  at  the  subharmonic  flow 
frequencies  provides  the  greatest  reduction  in  the  upper  surface  separation  region 
area  and  the  longest  delay  in  dynamic  stall  vortex  formation.  These  results  indicate 
that  extensions  of  high  lift  would  be  probable  under  similar  active  forcing 
conditions. 
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1:  INTRODUCTION 


The  ability  of  high-performance  aircraft  to  perform  in  combat  situations  is 
governed  by  aircraft  maneuverability  and  stability.  Design  changes  in  air  to  air 
weapons  have  increased  the  need  for  fighter— aircraft  to  execute  increasingly  difficult 
maneuvers,  in  which  the  best  possible  instantaneous  turning  performance  is  essential 
to  combat  superiority.  This  specification  has  lead  to  an  interest  in  aircraft  lift 
enhancement  and  stability  control. 

The  control  systems  on  air  to  air  combat  weapons  have  attained  all— aspect 
capabilities.  This  allows  them  to  move  in  conjunction  with  aircraft  maneuvers  and 
to  couple  firing  systems  with  flight  control  systems.  Because  of  these  all— aspect 
weapons,  combat  effectiveness  has  become  more  dependent  upon  the  unsteady 
performance  of  the  aircraft,  namely  the  ability  to  execute  increasingly  difficult 
maneuvers  in  which  the  best  possible  turning  ratio  is  essential  and  to  maintain  pilot 
control  necessary  for  weapon  aiming  accuracy  and  stall  prevention.  The  need  for 
high  turning  rates  result  in  flight  at  high  angles  of  attack,  which  nominally  leads  to 
separation  on  the  wing.  This  separation  can  lead  to  increased  drag,  wing  buffeting, 
and  stability  and  control  problems  which  degrade  combat  effectiveness  and  could 
lead  to  loss  of  aircraft  (Whitford,  1987). 

The  leading  edge  vortex  which  forms  over  an  airfoil  or  wing  during  a 
constant  or  oscillating  pitching  motion  has  been  shown  to  be  connected  to  aircraft 
performance.  Figure  1,  taken  from  flow  visualization  results  of  Walker  and  Chou 
(1987),  shows  the  characteristics  of  such  a  vortex.  Visually,  the  dynamic  stall 
vortex  begin  as  a  small  leading  edge  separation  bubble  and  grows  with  angle  of 
attack  until  it  detaches  from  the  upper  airfoil  surface.  This  vortex  is  associated 
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with  large  decreases  in  upper  surface  pressure  values  and  increased  pressure 
gradients.  The  presence  of  the  vortex  on  the  upper  airfoil  or  wing  surface  is 
preceded  by  enhanced  values  of  lift  and  drag.  It  is  necessary  to  examine  the  events 
leading  up  to  vortex  formation  in  order  to  determine  the  nature  of  the  flow 
conditions  which  create  the  increased  aerodynamic  forces.  Knowing  these 
conditions,  it  should  be  possible  to  actively  control  the  flow  and  maintain  enhanced 
force  characteristics. 

Although  only  a  few  active  control  methods  have  been  specifically  developed 
for  the  complicated  mechanisms  of  unsteady  separation  and  dynamic  stall 
phenomena,  numerous  control  techniques  have  been  explored  in  static  airfoil  cases 
which  hold  promise.  Significant  advancements  have  been  achieved  in  the  area  of 
acoustic  control  of  flow  over  static  airfoils,  step  flows,  and  mixing  layers  and  wakes. 
Employing  both  internal  and  external  acoustic  forcing,  researchers  have  been 
successful  in  both  reducing  the  separated  region  over  the  airfoil  and  delaying  stall  in 
conjunction  with  enhanced  lift  coefficients.  Although  it  is  not  practical  to  install  an 
acoustic  generator  on  a  aircraft,  there  is  sufficient  engine  bleed  air  from  most 
conventional  aircraft  to  supply  an  alternating  air  blowing  and  suction  system.  It 
has  been  shown  that  the  mechanisms  of  acoustic  control  are  similar  to  air 
injection/suction  (Williams,  1991).  Both  spanwise  and  tangential  blowing  have 
been  employed  successfully  to  delay  leading  edge  vortex  growth  on  delta  wings  and 
thus  extend  the  regime  of  stable,  controlled  flow  over  the  upper  wing  surface. 

The  previous  approaches  to  static  airfoil  control  can  be  extended  to  flow 
about  unsteady  airfoils  with  modifications.  The  flow  over  an  airfoil  pitching  at 
constant  rates  is  comparable  to  static  flow  prior  to  leading  edge  separation  (Carr, 
1988).  Thus,  it  is  reasonable  to  assume  that  the  established  techniques  currently 
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used  to  control  free  and  bounded  shear  flows  should  be  applicable  here. 
Gad-d— Hak  (1986)  noted  that  changing  the  leading  edge  flow  characteristics  should 
lead  to  changes  in  the  entire  flow  field.  This  study  focuses  on  the  determination  of 
the  characteristic  flow  frequencies  associated  with  the  Kelvin— Helmholtz 
instabilities  for  constantly  pitched  airfoils.  This  knowledge  will  then  be  used  to 
explore  the  benefits  of  external  acoustic  forcing  of  the  flow.  The  qualitative  and 
quantitative  information  produced  by  this  study  provide  important  insight  on  both 
the  mechanisms  leading  to  the  lift  enhancement  characteristic  to  such  flows  and  also 
the  means  by  which  to  manipulate  these  events  to  improve  airfoil  performance. 


2:  SUBJECT  OF  INVESTIGATION  AND  MAJOR  OBJECTIVES 

Although  significantly  more  energetic  and  complex,  the  unsteady  flow  around 
a  constantly  pitched,  two-dimensional  airfoil  in  part  bears  resemblance  to  free  shear 
and  reattaching  separated  flows  such  as  mixing  layers  and  flows  over  downstream 
facing  steps.  In  the  step  flow,  since  the  step  height  is  large  compared  with  the 
upstream  boundary  layer  thickness,  the  flow  field  immediately  downstream  of  the 
step  can  be  considered  a  boundary  layer  in  transition  to  a  free  mixing  layer  (Troutt, 
Scheelke,  and  Norman,  1984).  Similarly,  the  flow  over  a  pitching  airfoil  can  be 
initially  associated  vrith  a  free  shear  layer,  and  as  si  .  is  susceptible  to  small 
perturbations  via  the  Kelvin— Helmholtz  instabilities  (Ho  and  Hnerre,  1984). 

A  significant  amount  of  work  has  been  accomplished  which  links  the  initial 
Kelvin— Helmholtz  instability  frequency  with  active  forcing  of  flow  fields.  The 
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ability  to  apply  the  successful  static  studies  to  the  dynamic  airfoil  flow  rests  upon 
two  recent  experimental  determinations: 

1)  The  unsteady  separated  region  which  occurs  prior  to  the  dynamic  stall  vortex 
formation  is  similar  to  the  separation  region  occurring  over  a  static  airfoil 
and  other  less  physically  similar  flows  (such  as  the  flow  over  a  downstream 
facing  step,  etc)  (Gad-el— Hak  and  Mangaiam,  1991). 

2)  The  lift  enhancement  associated  with  dynamic  flows  embodies  itself  prior  to 
the  development  of  the  dynamic  stall  vortex,  and  thus  the  initial  unsteady 
separation  region  is  what  must  be  focused  upon  when  evaluating  active 
controls  (Albertson,  Trontt,  and  Kedzie,  1988). 

This  research  study  involves  the  evaluation  and  the  development  of  acoustic 
and  tangential  pulsed  air  blowing  control  methods  for  a  constantly  pitched  NAG  A 
0015  airfoil.  These  applied  methods  are  then  analyzed  using  flow  visualization.  A 
key  step  in  the  utilization  of  these  methods  is  the  accurate  determination  of  the 
characteristic  frequencies  associated  with  the  Kelvin— Helmholtz  instabilities  on  both 
a  static  and  a  dynamic  airfoil.  This  is  accomplished  by  visual  observation  of  the 
instability  waves  and  by  the  use  of  hot-film  anemometry  to  observe  the  associated 
doppler-like  pulse  created  by  the  instabilities.  This  study  covers  the  low  range  of 
non-dimensional  pitch  rates,  from  0.01  to  0.15,  since  the  slower  moving  airfoil 
bears  an  even  greater  resemblance  to  the  static  cases,  and  this  realm  is  where  the 
greatest  difference  in  aerodynamic  force  coefficients  occurs. 
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The  primary  objective  of  the  proposed  experiments  is  thus  to  determine  the 
characteristic  flow  frequencies  that  are  coupled  with  dynamic  stall  flows  and  to 
establish  that  the  associated  energetic  flows  can  be  effectively  controlled  by  proven 
active  means.  Once  the  key  forcing  parameters  are  identified,  the  actively  modified 
flow  behavior  can  be  evaluated  and  conclusions  can  be  drawn  concerning  the 
benefits  of  such  forcing  and  subsequent  evaluations  can  be  made  to  generalize  the 
results  to  new  designs  or  techniques  for  practical  applications. 


3:  SUMMARY  OF  PREVIOUS  WORK 

Recent  significant  advances  in  air  combat  capabilities  have  shifted  the  focus 
of  fighter  aircraft  design  from  performance  specifications  (energy)  to  extended 
maneuverability  capabilities.  The  development  of  all-aspect  short-range  infrared 
guided  missiles  and  all— aspect  capability  guns  coupled  with  aircraft  position 
controls,  the  emphasis  in  close  air  combat  design  has  shifted  to  enhanced 
maneuverability.  (Herbst,  1983a  &  1983b).  When  airfoils  or  wings  are  pitched 
rapidly  beyond  their  static-stall  angle  of  attack,(  the  unsteady  motion)  results  in  a 
delay  of  stall  and  aerodynamic  force  significantly  greater  than  the  static  stall  case. 
The  ensuing  complex,  unsteady  flow  process  is  characterized  in  general  as  dynamic 
stall,  and  has  been  the  subject  of  numerous  investigations.(McCroskey,  1982,  & 
Carr,  1988) 

Prior  to  embarking  on  any  analysis  of  the  flow  events  leading  up  to  and 
including  the  development  of  dynamic  stall,  a  comprehensive  review  of  research 
results  to  date  is  necessary.  The  interest  in  forced,  unsteady,  separated  flows 
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evolved  from  the  first  studies  of  dynamic  overshoot  in  lift  on  helicopter  rotors  (Ham 
and  Gaidick,  1968)  to  current  investigations  in  two  areas,  low  amplitude  oscillating 
airfoils  aud  wings  and  high  amplitude,  high  rate  constant  pitching  motions.  For  the 
purposes  of  this  experimental  study,  only  constant  rate  two-dimensional  motions 
are  being  considered.  The  reasoning  for  this  focus  is  specified  later. 

The  interest  in  unsteady  separated  flows  lies  in  the  large  transient 
aerodynamic  forces  they  generate.  The  focus  of  early  investigations  into 
two-dimensional,  unsteady  separation  focused  upon  the  inherent  presence  of  the 
dynamic  stall  vortex,  the  leading  edge  vortical  structure  which  is  characteristic  of 
these  flows.  Such  flows  are  characterized  by  a  non-dimensional  pitch  rate,  a"*", 
defined  as  cic/U^,  where  a  is  the  actual  pitch  rate,  c  is  the  airfoil  chord,  and  is 
the  free  stream  velocity.  Cook  (1987)  showed  that,  for  inertia  force  dominated 
flows,  the  non-dimensional  pitch  rate  could  provide  effects  that  are  orders  of 
magnitude  greater  than  the  Reynolds  number  effects.  Walker  and  Chou  (1987) 
substantiated  this  analysis  through  experimentation. 

Walker  and  Chou  (1987)  obtained  a  detailed  qualitative  sequence  of  events 
leading  up  to  dynamic  stall  through  smoke  wire  flow  visualization  of  the  flow. 
Illustrated  by  the  author  in  Figure  1,  dynamic  stall  vortex  development  is  shown  for 
an  of  0.1,  with  the  airfoil  pivot  point  located  at  0.25c.  1(a):  The  flow  transitions 
from  quasi-steady  flow  with  symmetric  alternating  vortices  shed  into  the  wake  to 
an  asymmetric,  unsteady  flow.  1(b):  The  upper  surface  separation  zone  moves 
forward  from  the  trailing  edge,  accompanied  by  an  increase  in  relative  strength  of 
the  trailing  edge  counter-clockwise  vortices.  1(c):  As  the  separation  zone  grows,  a 
shear  layer  interaction  between  the  inviscid  outer  flow  and  the  viscous  boundary 
layer  produces  a  recirculation  region  where  a  number  of  small  clockwise  vortices 
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appear.  These  clockwise  vortices  are  similar  to  structures  caused  by 
Kelvin— Helmholtz  instabilities  in  free  shear  flows.  1(d):  The  shear  vortices  are 
absorbed  into  a  turbulent  region  which  has  formed  and  moved  forward  from  the 
trailing  edge.  A  leading  edge  separation  bubble  has  formed  near  the  leading  edge. 
1(e):  The  leading  edge  bubble  manifests  itself  in  the  form  of  a  clockwise  vortex,  the 
dynamic  stall  vortex,  and  grows  on  a  scale  comparable  to  the  airfoil  chord.  1(f): 
The  dynamic  stall  vortex  has  detached  from  the  airfoil  surface  and  convects 
downstream  perpendicular  to  the  airfoil  chord.  A  trailing  edge  vortex  forms  on  the 
trailing  edge. 

The  development  of  the  unsteady  separation  region  over  a  constantly  pitched 
airfoil  and  the  growth  of  the  dynamic  stall  vortex  results  in  extensive  variations  in 
the  upper  surface  pressure  field.  Frands  and  Keesee  (1985)  conducted  an  analysis  of 
the  pressure  field  on  a  constantly  pitched  airfoil.  They  found  that,  although  there  is 
significant  dependence  on  non-dimensional  pitch  rate,  the  qualitative  features  of  the 
surface  pressure  field  are  similar  in  various  dynamic  stall  cases.  At  attack  angles 
lower  than  the  corresponding  static  stall  values,  the  configuration  of  the  pressure 
distributions  resembles  quasisteady  (static)  flow,  except  for  a  lag  in  pressure 
coefiicient  magnitudes.  As  the  angle  of  attack  passes  the  static  stall  point,  the 
pressure  distribution  for  an  attached  flow  persists,  again  vnth  pressure  coeffiaent 
magnitudes  in  excess  of  steady  flow  values.  The  suction  peak  detaches  from  the 
leading  edge  region  and  moves  downstream  along  the  suction  surface  as  the  angle  of 
attack  continues  to  increase.  Comparing  the  behavior  of  the  upper  surface  pressure 
field  with  flow  visualization  data  by  Walker,  HeUn,  and  Strickland  (1985)  reveals 
that  the  suction  peak  position  corresponds  to  vortex  position  over  the  airfoil.  This 
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was  confirmed  by  Albertson,  et  al  (1987)  in  a  digital  image  analysis  study  tracking 
vortex  movement  over  the  airfoil. 

The  manifestation  of  these  increased  suction  magnitudes  for  unsteady  airfoil 
flow  results  in  accentuated  lift  and  drag  values  as  compared  to  static  cases.  Lift 
coefficients  more  than  double  the  static  stall  values  have  been  determined  for 
non-dimensional  pitch  rates  as  low  as  0  03  (Albertson,  Trontt,  and  Kedzie,  1988). 
Drag  coefficient  values  were  found  to  increase  correspondingly,  although  analysis  of 
the  lift  to  drag  ratios  showed  airfoil  performance  enhancements  well  above  static 
airfoil  experiments. 

The  non-dimensional  pitch  rate,o^,  has  been  identified  numerous  times  as  a 
characteristic  descriptive  parameter  for  dynamic  stall  flows.  The  variation  in  the 
flow  field  vdth  respect  to  can  be  categorized  into  three  areas,  flow  visualization, 
surface  pressure  measurements,  and  surface  velocity  field  measurements.  In  a 
comprehensive  flow  visualization  analysis.  Walker,  Hdin,  and  Strickland  (1985) 
found  that  increasing  from  0.2  to  0.6  both  delayed  flow  separation  and  dynamic 
stall  vortex  formation.  In  addition,  at  the  higher  value,  significant  secondary 
vortical  structures  appear.  Albertson,  et  al  (1987)  used  digital  image  analysis  to 
quantify  the  growth  and  movement  of  the  leading  edge  vortex  with  respect  to  angle 
of  attack.  They  determined  that,  for  both  a**"  values  of  0.1  and  0.2,  the  dynamic 
stall  vortex  experiences  a  short  period  of  slow  growth,  suggesting  a  quasi^table 
period,  followed  by  rapid  growth  and  eventual  vortex  detachment.  The  angle  of 
attack  at  which  the  vortex  commences  enhanced  growth  coincides  closely  to  the 
point  when  dynamic  stall  occurs.  It  was  also  ascertained  that  the  commencement  of 
rapid  vortex  growth  coincides  with  the  attack  angle  at  which  the  vortex  center  is 
located  approximately  over  the  quarter  chord  position.  Jumper,  Schreck,  and 
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Dimmick  (1985)  suggested  that  a  plateauing  in  their  lift  curves  with  respect  to  angle 
of  attack  corresponded  with  the  quarter  chord  separation  and  the  subsequent 
dynamic  stall  vortex  detachment  from  the  airfoil  surface.  These  results  indicate 
that  there  indeed  is  a  connection  between  the  quarter  chord  separation  point  and 
the  overall  aerodynamic  performance  of  the  airfoil  which  bears  further  investigation. 

The  effect  of  non-dimensional  pitch  rate  change  on  the  upper  surface 
pressure  field  Las  been  the  subject  of  several  pressure  surface  studies.  Experimental 
analyses  performed  by  Walker,  Helin,  and  Chon  (1985)  indicated  that  lift 
enhancement  associated  with  the  unsteady  airfoil  motion  magnifies  with  increasing 
pitch  rate.  Observing  the  flow  over  a  range  from  =  0.05  to  0.6,  they  determined 
that  both  the  magnitude  of  maximum  lift  and  the  angle  of  attack  at  which  it  occurs 
(dynamic  stall  angle)  increase  with  a*^.  As  expected,  an  increase  in  the  initial  slope 
of  the  lift  coefficient  curve  accompanies  these  trends.  Francis  and  Keesee  (1985) 
showed  that  the  relationship  between  the  maximum  lift  coefficient  and  over  a 
pitch  rate  range  from  zero  to  0.4  follows  the  same  trend  as  the  attack  angle  at  which 
dynamic  stall  occurs.  Strickland  and  Graham  (1986)  introduced  a  stall  delay  angle, 
defined  as: 

stall  =  dyn.  stall  “  “static  stall 

where  is  the  angle  of  attack  corresponding  to  dynamic  stall  and 

“static  stall  angle  of  attack  corresponding  to  static  stall.  The  occurrence  of 
dynamic  stall  is  based  upon  the  occurrence  of  leading  edge  separation.  They  found 
that  the  stall  delay  angle  at  the  nose,  Aojyj  is  proportional  to  the  square  root 
of  the  non-dimensional  pitch  rate.  Based  upon  research  that  the  maximum  lift 
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coefficient  follows  the  same  trend  as  the  pitch  angle  corresponding  to  dynamic  stall, 
it  can  be  deduced  that  Cj^  should  follow  the  same  trend.  This  has  not  been 
analyzed. 

The  connection  between  the  surface  velocity  field  over  a  constantly  pitched 
airfoil  and  the  non-dimensional  pitch  rate  was  investigated  by  Walker,  Helin,  and 
Strickland  (1985)  and  by  Walker  and  Chou  (1987).  Walker,  Helin,  and  Strickland 
discovered  reverse  flow  velocities  directly  under  the  dynamic  stall  vortex  to  be  over 
140%  U  for  an  a'*'  of  0.2  and  over  210%  U  for  an  of  0.6.  It  was  also  noted 


(D  CD 

that  the  leading  edge  vortex  initially  produces  a  much  higher  reverse  flow  velocity 
than  the  trailing  edge  vortex  does.  Walker  and  Chon  confirmed  these  results,  and 
additionally  disclosed  that  the  upper  surface  velocity  reaches  a  sub— peak 
corresponding  to  the  point  of  dynamic  stall  vortex  initiation,  and  then  resumes  an 
increasing  trend  rapidly  to  the  primary  velocity  peak. 

It  has  been  demonstrated  that  the  airfoil  pivot  location  has  a  substantial 
effect  on  the  unsteady  flow  field  development.  Helin  and  Walker  (1985)  found  that 
as  the  pivot  point  moves  downstream  along  the  chord,  the  onset  of  dynamic  stall  is 
delayed.  The  subsequent  rapid  development  and  movement  of  the  vortex  over  the 
airfoil  has  no  appreciable  qualitative  variation  with  pivot  location. 

Stephen,  et  al  (1989)  confirmed  that  the  flow  develops  independently  of  pitch 
axis  location  after  the  leading  edge  vortex  forms.  In  their  comparative  study  of  a 
two-dimensional  NACA  0015  airfoil  pitched  about  axes  from  half  a  chord  length 
forward  of  the  airfoil  to  half  a  chord  length  aft  of  the  airfoil,  they  determined  that 
the  size  of  the  vortical  disturbances  are  similar.  This  indicates  that  the  vortex 
strength  may  be  comparable  in  each  case.  Although  moving  the  pitch  axes  aft  does 
delay  the  flow  development,  the  magnitude  of  the  pressure  peak  and  thus  of  the 
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aerodynamic  pressure  forces  decreases.  In  both  the  non-dimensional  pitch  rates 
studied,  =  0.1  and  15,  the  delay  due  to  an  increase  in  was  greater  than  the 
delay  due  to  changing  pitch  axis.  The  trade  off  between  dynamic  stall  delay  and 
enhanced  airfoil  performance  has  not  been  thoroughly  investigated  to  date. 

Although  the  two-dimensional  mixing  layer  is  in  many  aspects  dissimilar 
from  a  two-dimensional  unsteady  airfoil  flow,  the  fundamental  growth  mechanisms 
are  analogous.  As  described  at  the  beginning  of  this  section,  the  first  indication  of 
disturbances  in  the  viscous  flow  around  a  NACA  0015  airfoil  pitched  at  a  constant 
rate  is  when  flow  reversals  appear  near  the  surface  at  the  rear  of  the  airfoil.  This 
reversal  moves  up  the  airfoil  surface,  and  large  eddies  materialize  in  the  interface 
(shear)  layer  between  the  invisdd  free  stream  and  the  viscous  boundary  layer.  It  is 
these  eddies,  originated  by  the  Kelvin— Hdmholtz  instabilities,  which  can  be  seen  to 
combine  and  contribute  to  the  growth  of  the  unsteady  separation  region  and  the 
subsequent  dynamic  stall  vortex  growth  (Albertson,  1989). 

The  plane  mixing  layer  was  first  shown  to  contain  large  scale  structures  by 
the  experiments  of  Brown  and  Roshko  (1974).  Winant  and  Browand  (1974)  added 
that  the  growth  of  the  mixing  layer  was  due  to  the  propagation  of  the  instability 
waves,  which  then  roll  up  into  discrete  two-dimensional  vortex  structures.  The 
:bsequential  growth  of  the  mixing  layer  is  due  to  the  interaction  between  these 
large  scale  vortex  structures,  known  as  "pairing".  Since  this  two-dimensional 
mixing  layer  is  significantly  simpler  than  bounded  flows,  a  great  deal  of  analysis  has 
been  accomplished  to  identify  the  means  with  which  to  control  the  growth  and 
behavior  of  the  large  scale  structures.  The  initial  instability  frequency  has  been 
shown  to  fluctuate  in  time  by  as  much  as  10%,  and  there  is  actually  a  broad  band  of 
frequencies  present  (Browand,  1986) 
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It  has  been  conclusively  demonstrated  that  the  large  scale  vortex  structures 
present  in  the  mixing  layer  can  be  modified  by  external  forcing.  Ho  and  Huang 
(1982)  and  Oster  and  Wygnanski  (1982)  showed  that  the  pairing  interactions 
between  the  large  scale  structures  (and  thus  mixing  layer  growth)  can  be  modified 
by  introducing  coherent  perturbations  at  the  initiation  of  mixing.  By  controlling 
the  pairing  interactions,  the  spreading  rate  of  the  mixing  layer  can  be  enhanced  or 
inhibited,  in  some  extents  even  changing  the  signs  of  the  Re3molds  stress  values  and 
reducing  turbulent  energy.  Ho  and  Huang  specifically  determined  that  the 
mechanisms  by  which  vortex  pairing  was  controlled  depended  upon  the  frequency  at 
which  the  flow  was  controlled. 

The  methods  of  vortex  pairing  control  have  been  extended  to  include  the 
shear  layer  of  a  reattaching  separated  flow.  Trontt,  Schedhe,  and  Norman  (1984) 
verified  that  large  scale  structures  similar  to  those  present  in  the  mixing  layer  were 
also  characteristic  of  the  reattaching  separated  flow  over  a  downstream  facing  step. 
Bhattacharjee,  Schedke,  and  Trontt  (1986)  and  Rooe  and  Kegdman  (1986)  explored 
control  of  the  flow  over  a  downstream  facing  step  using  two  separate  control 
mechanisms.  Bhattacharjee,  Scheelke,  and  Trontt  (1986)  used  a  hot-wire  probe  to 
ascertain  the  characteristic  frequency  associated  with  the  Kelvin-Helmholtz 
instabilities.  Power  spectra  of  the  signal  yielded  a  broad  peak  that  gradually  shifted 
towards  lower  frequencies  with  downstream  position.  This  shift  is  accredited  to  the 
large-scale  vortex  amalgamations  occurring  in  the  separated  shear  layer.  Forcing  at 
the  characteristic  initial  vortex  passage  produces  a  sharp  spike  in  the  power  spectra 
at  the  natural  flow  frequency.  This  coincides  with  increased  temporal  and  spatial 
correlation  in  the  spanwise  flow  and  a  considerable  reduction  in  reattachment 
length.  Forcing  between  Strouhal  numbers  of  0.2  and  0.4  is  the  most  effective 


forcing  range  over  a  large  range  of  Reynolds  number.  Strouhal  number,  St,  is 
defined  as  the  frequency,  multiplied  by  the  characteristic  length  and  divided  by  the 
free  stream  velocity.  Roos  and  Kegdman  (1986)  confirmed  these  results  using  an 
oscillating  flap  at  the  step  edge  to  excite  the  flow. 

Several  experimental  analyses  have  been  performed  upon  static  airfoils  under 
varying  control  conditions.  Ahnja  and  Bunin  (1984)  used  high  frequency  external 
acoustic  control  over  a  cambered  airfoil  to  improve  lift  coefficients  over  50%  greater 
than  natural  conditions.  Lift  enhancement  is  highly  dependent  upon  both  frequency 
and  amplitude,  with  the  greatest  lift  increase  occurring  near  665  Hz.  This  frequency 
is  directly  proportional  to  the  characteristic  flow  frequency. 

By  internally  injecting  acoustic  forcing  near  the  characteristic  flow 
frequency,  Collins  (1981)  was  able  to  partially  reattach  the  upper  surface  separation 
and  increase  lift  coefficients  by  20%.  Maestiello  (1986)  confirmed  this  with  external 
acoustic  forcing,  and  in  addition  determined  that  velocity  perturbation  magnitudes 
in  the  region  of  transition  could  be  reduced  significantly. 

Zaman,  Bar— Sever,  and  Mangalam  (1987)  were  able  to  remove  laminar 
separation  completely  by  low  frequency  (St  <  5)  external  acoustic  oscillations  over  a 
smooth  airfoil.  In  addition,  lift  enhancements  were  achieved  with  large  amplitude, 
high  frequency  excitation  (St  =  4—25)  in  the  post-stall  regime.  Tunnel 
cross— resonances  induce  large  transverse  velocity  fluctuations  near  the  airfoil  that 
enhance  the  separation  control.  However,  these  fluctuations  would  not  be  present  in 
the  open  flow  over  an  aircraft  in  actual  flight.  The  authors  suggest  that  the 
excitation  mechanisms  which  produce  reduced  separation  must  hinge  on  the 
instability  of  the  separated  shear  layer,  but  must  also  be  influenced  by  the  presence 
of  the  solid  boundary  and  the  separation  location.  Huang,  Maestrello,  and  Bryant 
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(1987)  discovered  that  the  shear  layer  was  extremely  sensitive  to  sound  excitation  in 
the  vicinity  of  the  separation  point.  In  their  study  of  internal  acoustic  excitation  of 
a  static  airfoil,  they  also  verified  that  the  most  beneficial  forcing  frequencies  are 
those  of  the  instability  waves.  Forcing  at  the  fiindamental  shedding  frequency  or  its 
harmonic  increases  entrainment  in  the  early  part  of  the  shear  layer  and  drastically 
reduces  the  extent  of  separation. 

The  strong  coupling  between  the  injected  sound  and  the  shear  layer 
instability  frequency  was  confirmed  by  Huang,  Maestrello,  and  Bryant  (1987). 
They  were  able  to  reduce  the  region  of  separation  over  a  symmetrical  airfoil  by 
injecting  sound  through  the  leading  edge  of  the  airfoil.  At  the  same  time,  there  is 
increased  entrainment  on  the  early  part  of  the  shear  layer  and  increased  circulation. 
The  stall  angle  of  attack  is  delayed  and  lift  magnitude  is  enhanced  significantly  for 
extended  attack  angles. 

Spa^iwise  blowing  has  been  successfully  employed  by  numerous  researchers  to 
take  advantage  of  the  enhanced  lift  created  in  unsteady  flows.  The  initial  interest 
in  this  particular  method  of  control  arose  from  the  need  to  manipulate  the  flow  over 
static  delta  wings,  which  are  characterized  by  continuing  leading  edge  vortex  growth 
and  breakuown.  Both  Bradley  and  Wray  (1974)  and  Campbell  (1976)  found  that 
blowing  a  stream  of  high-pressure  air  over  a  wing  surface,  parallel  to  the  leading 
edge,  delayed  leading  edge  vortex  growth  and  thus  the  deleterious  effects  of  vortex 
breakdown  at  the  higher  angles  of  attack.  This  delay  is  accompanied  by  lift 
increases  of  up  to  50%  over  uncontrolled  flows.  The  delay  of  vortex  breakdown 
postpones  static  stall  of  the  lifting  surface  as  well.  Seginer  and  Salomon  (1986)  used 
spanwise  blowing  over  a  canard-wing  configuration  for  static  angles  of  attack  to 
augment  both  lift  and  lift-to-drag  ratios  and  delay  static  stall. 
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Using  a  slightly  different  means,  Roberts,  et  al  (1985)  and  Wood  and  Roberts 
(1988)  used  tangential  mass  injection  through  a  slot  along  the  leading  edge  of  a 
static  delta  wing.  Direct  control  of  the  primary  separation  allows  significant  control 
of  the  vortex  flow  up  to  sixty  degrees  angle  of  attack.  The  primary  effect  of 
tangential  leading  edge  blowing  is  to  reduces  the  strength  of  the  vortical  flow, 
resulting  in  an  extended  regime  of  stable,  controlled  vortical  flow  over  the  upper 
surface  of  the  wing.  In  addition,  the  separation  line  relocates  to  an  in  board 
position  under  the  forcing. 

Successful  attempts  have  been  made  to  control  unsteady  flows  such  as  the 
one  discussed  here.  Carr  and  McAlister  (1983)  used  a  leading  edge  slat  on  an 
oscillating  airfoil  to  produce  a  flow  that  remains  attached  to  the  airfoil  for  angles  of 
attack  well  above  those  characteristic  to  the  natural  flow.  The  djmamic  stall  was 
significantly  delayed,  while  at  the  same  time  the  severity  of  the  stall  was  reduced. 
Luttges,  Robinson,  and  Kennedy  (1985)  were'  able  to  obtain  similar  results  by 
introducing  single  air  pulses  through  a  two-dimensional  slot  located  at  0.2c  on  an 
oscillating  NACA  0015  surface.  The  flow  control  was  affective  only  when  the  pulse 
corresponded  to  periods  of  high  shear  and  large  accumulations  of  vortidty. 

The  experiments  reviewed  here  have  revealed  control  techniques 
which  are  proven  to  reduce  separation,  delay  static  stall,  and  enhance  lift  over 
airfoils  and  wings.  The  current  experimental  analysis  seeks  to  adapt  the 
demonstrated  methods  of  controlling  shear  flows  to  active  forcing  of  dynamic  stall 
flows.  Active  control  of  the  flow  over  a  constantly  pitching  airfoil  should  result  in 
initial  reduction  in  the  unsteady  separation  region,  dynamic  stall  delay,  and 
extensions  of  the  high-lift  envelope. 
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4:  DESCRIPTION  OF  RESEARCH  PROCEDURES 


The  experiments  discussed  here  were  conducted  in  the  Frank  J.  Seiler 
Research  Laboratory  open  return,  low  speed  wind  tunnel  at  the  U.  S.  Air  Force 
Academy.  The  wind  tunnel  has  a  3.0  ft  x  3.0  ft  test  section.  A  NACA  0015  airfoil 
with  a  6  in  chord  and  a  2  ft  span  was  pitched  at  constant  rates  from  0°  to  50°  about 
the  0.25c  pitch  axis.  The  four  non-dimensional  pitch  rates,  of  0.01,  0.05,  0.1, 
and  0.15  were  applied.  The  airfoil  motion  was  accomplished  using  a  stepper  motor 
assembly  controlled  by  a  MassComp  5500  microcomputer  system. 

The  flow  was  visualized  by  introducing  smoke  using  a  smoke  wire.  The 
smoke  wire  technique  used  by  Hdin  and  Walker  (1985)  was  employed.  Theatrical 
fog  fluid  was  applied  to  a  0.005  in  tungsten  wire,  leaving  fine  droplets  almost 
uniformly  along  the  wire.  A  current  applied  to  the  wire  evaporated  the  oil, 
producing  fine  streaklines  across  the  test  section. 

The  visibility  of  the  smoke  depends  upon  the  amount  of  light  being  scattered 
by  the  smoke  particles  themselves.  Maximum  efGdency  results  when  the  smoke  is 
illuminated  by  direct  light.  If  other  parts  of  the  test  section  are  illuminated  with 
too  great  a  light  level,  the  background  reflections  may  overpower  the  smoke.  The 
lighting  for  the  vnnd  tunnel  was  provided  by  synchronized  strobe  lights.  The  high 
intensity  arc-lamp  strobe  lights  are  synchronized  with  a  35mm  still  camera  and  a 
16mm  high-speed  camera  to  illuminate  the  streakline  flow.  The  strobe  lights  have 
an  average  7/iS  flash  duration,  which  froze  the  flow. 

The  characteristic  flow  frequencies  were  obtained  using  two  separate 
methods.  To  ascertain  a  preliminary  frequency  of  the  Kelvin— Helmholtz  instability 
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waves,  results  from  high-speed  film  at  the  four  specified  non-dimensional  pitch 
rates  was  analyzed.  The  waves  were  visually  counted  with  repeatable  results. 
These  frequencies  were  then  verified  by  placing  a  hot— film  probe  near  the  surface  of 
the  flow  and  evaluating  the  signal  through  power  spectra  methods. 


5:  EXPERIMENTAL  RESULTS 


The  first  segment  of  the  research  project  was  to  develop  concise  means  of 
acoustic  and  tangential— pulsed  air  forcing  applicable  to  a  NACA  0015  airfoil  under 
constant  pitch  motions.  The  tangential— pulsed  air  system  is  shown  in  Figure  2. 
Three  spanwise,  two  dimensional  slots  1/16  "  wide  and  18"  long  are  placed  in  the 
airfoil  at  the  leading  edge,  20%  chord,  and  40%  chord.  The  slots  are  configured  so 
that  the  exiting  air  follows  the  airfoil  surface  for  a  period  of  time,  creating  control 
along  the  surface  at  the  slot.  The  slots  are  supplied  with  air  on  each  end.  A  timing 
circuit  translates  a  signal  from  a  square  wave  generator  and  simultaneously  pulses 
six  valves  at  a  specified  frequency  between  zero  and  sixty  hertz.  The  control  system 
allows  either  coincident  pulsing  through  each  slot  or  forcing  from  any  given  slot 
combination.  A  flow  visualization  study  was  performed  to  verify  that  the  presence 
of  the  slots  themselves  did  not  significantly  alter  the  flow. 

The  acoustic  forcing  system  is  shown  in  Figure  3.  A  15",  220  watt  woofer  is 
placed  in  an  enclosure  so  that  the  optimum  sound  intensity  and  quality  exits 
through  a  2.25"  hole  in  the  top  of  the  casing.  An  additional  sleeve  padded  with 
acoustic  foam  surrounds  the  speaker  enclosure  to  dampen  out  resonant  frequencies 
from  the  box  itself.  Sound  is  supplied  to  the  test  section  through  a  series  of  rigid 
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pipes,  which  permit  transmission  of  a  low  noise  signal  in  a  non-4ntrusive  manner. 
The  acoustic  sound  is  focused  on  the  leading  portion  of  the  airfoil,  since  forcing 
nearest  to  the  origination  of  the  instability  waves  yields  the  greatest  effect.  The 
speaker  is  controlled  with  a  sine  wave  generator  to  produce  frequencies  as  low  as  15 
hertz.  The  signal  at  frequencies  lower  than  15  hertz  is  inconsistent,  and  control 
required  at  these  levels  was  provided  by  the  pulsed  air  system.  An  attempt  was 
made  to  acoustically  force  the  flow  internally  through  the  slots  designed  for  pulsed 
air  blowing.  Unfortunately,  there  was  a  problem  with  the  transmission  of  the 
acoustic  pressure  wave,  and  sufGdent  acoustic  amplitude  did  not  reach  the  slots. 

The  next  step  in  the  experiment  was  to  ascertain  the  characteristic 
frequencies  associated  with  the  Kelvin— Helmholtz  instabilities.  These  frequencies 
were  first  visually  measured  from  high  speed  films  of  the  streak-line  flow  for  the 
four  different  non-dimensional  pitch  rates.  Although  not  a  completely  accurate 
method  of  frequency  determination,  this  procedure  allowed  an  initial  estimate  that 
aided  in  interpretation  of  the  more  accurate  hot— film  data.  By  placing  a  hot— film 
probe  just  outside  the  airfoil  boundary  layer,  a  clear  trace  of  the  instability  induced 
waves  was  obtained.  As  expected,  these  traces  resemble  the  form  of  a  Doppler 
burst.  Figure  4  shows  examples  of  the  natural  signal,  the  signal  acoustically  forced 
at  the  characteristic  frequency,  and  subsequently  at  the  first  subharmonic  of  the 
flow.  The  hot— film  signals  were  first  quantified  using  a  storage  oscilloscope,  and 
subsequently  using  a  power  spectra  analysis. 

Examples  of  power  spectra  for  the  static  case  at  an  angle  of  attack  of  five 
degrees  and  the  dynamic  cases  of  a"*"  =  0.01  and  0.05  are  given  in  Figure  5.  The 
free  stream  velocity  for  these  cases  was  25  ft/sec.  Three  dominant  frequencies  are 
apparent  in  the  static  case  at  20  Hz,  10  Hz,  and  5  Hz.  As  the  20  Hz  signal  is  the 
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most  powerful,  it  indicates  the  natural  frequency  associated  with  the 
Kelvin— Helmholtz  instability  waves.  The  peaks  and  5  Hz  and  10  Hz  show  that 
pairing  of  the  instability  waves  is  occurring,  resulting  in  the  emergence  of  the 
subharmonics.  The  non-dimensional  pitch  rates  of  oi^  =  0.01  and  0.05  yieli  strong 
peaks  near  50  Hz,  specifically  at  52  Hz  and  56  Hz,  respectively. 

Flow  visualization  at  a  non-dimensional  pitch  rate  of  0.01  is  shown  in  Figure 
6  for  the  non— forced  (natural)  case  and  for  the  flow  externally  forced  at  the 
characteristic  frequency  of  15  Hz.  Since  the  low  freestream  velocity  of  10  ft/sec 
required  for  flow  visualization  did  not  allow  accurate  hot— film  measurements,  the 
characteristic  instability  frequencies  were  determined  from  high-speed  film.  It 
should  be  noted  that  the  natural  frequency  of  the  flow  decreased  with  freestream 
velocity.  At  attack  angles  lower  than  a  —  20“,  there  are  no  discernible  differences 
between  the  forced  and  natural  flow.  The  two  cases,  columns  one  and  two,  differ 
significantly  at  this  angle  of  attack.  There  is  a  loosely  defined  leading  edge,  or 
dynamic  stall,  vortex  present  on  the  upper  airfoil  surface  for  both  the  natural  and 
forced  flow.  However,  the  vortex  on  the  natural  airfoil  covers  a  larger  portion  of  the 
upper  airfoil  surface  than  in  the  controlled  case. 

The  differences  between  the  flows  have  diminished  by  o  =  25°,  but  it  is  still 
clear  that  the  natural  flow  contains  traces  of  large  scale  structures.  Off  the  trailing 
edge,  an  upward  trend  in  the  flow  coincides  with  a  faint  trailing  edge  vortex, 
characteristic  of  dynamic  stall  flows.  There  is  also  a  less-defined  vortex  near  the 
leading  edge.  The  controlled  flow  shows  no  evidence  of  such  structures.  By 
a  =  30°,  the  two  flows  again  appear  similar. 
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A  flow  visualization  study  was  also  conducted  at  a  pitch  rate  of  0.05  using 
external  acoustic  forcing.  Figure  7  shows  both  the  natural  case  in  column  one  and 
the  forced  case,  33  Hz,  in  column  two.  As  at  an  of  0.01,  the  effects  of  the 
acoustic  forcing  are  not  evident  prior  to  an  angle  of  attack  of  20°.  At  this  attack 
angle,  a  slight  disruption  occurs  in  the  forced  alternating  vortices  shed  off  the 
trailing  edge.  The  apparent  accelerated  mixing  of  the  flow  continues  to  an  attack 
angle  of  25°,  where  the  acoustically  forced  case  shows  a  separated  region  that  is 
slightly  smaller  than  the  than  the  natural  case.  At  a  =  30°,  a  dynamic  stall  vortex 
is  clearly  developing  on  the  natural  airfoil,  indicated  by  the  region  void  of  smoke 
near  the  leading  edge.  Although  there  is  a  similar  void  region  near  the  leading  edge 
of  the  forced  airfoil,  it  is  not  as  developed.  In  addition,  the  width  of  the  upper 
surface  separation  region  is  reduced  in  the  controlled  case.  By  a  =  35°  the  two 
flows  are  similar. 

It  is  forcing  at  the  subharmonic  frequency  that  has  been  most  beneficial  in 
controlling  flows  such  as  a  wake  or  a  mixing  layer.  In  these  flows,  controlling  the 
flow  at  the  subharmonic  frequency  enhances  vortex  pairing  and  delays  separation. 
It  is  reasonable  to  assume  that  forcing  at  the  subharmonic  frequency  of  an  unsteady 
flow  will  yield  similar  results.  Figure  8  shows  the  flow  for  several  angles  of  attack 
under  natural  and  external  subharmonic  control.  The  two  flows  are  similar  prior  to 
a  =  25°.  At  that  attack  angle,  the  controlled  flow  appears  slightly  more  slightly 
more  turbulent,  especially  near  the  trailing  edge.  In  addition,  there  is  no  evidence 
of  the  depressed  area  of  the  separation  region  midway  along  the  airfoil  that  indicates 
dynamic  stall  vortex  formation  in  the  natural  case.  At  a  =  30°,  there  is  still  no  sign 
of  a  dynamic  stall  vortex  in  the  forced  flow,  while  one  is  clearly  present  on  the 
natural  airfoil.  In  addition,  the  overall  width  of  the  separation  region  near  the 
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airfoil  leading  edge  is  reduced  by  approximately  half  in  the  controlled  flow.  By 
a  =  35°,  a  dynamic  stall  vortex  has  formed  on  the  forced  airfoil  as  well  and  there 
are  no  discernible  differences  between  the  two  cases. 


6:  CONCLUSIONS 

The  analysis  of  acoustically  and  tangentially  controlled  flow  over  a  airfoil 
pitching  at  the  constant  non-dimensional  rates  of  0.01,  0.05,  0.1,  and  0.15  has 
yielded  the  following  conclusions: 

1.  Forcing  the  flow  at  the  characteristic  frequency  delays  dynamic  stall  vortex 
growth,  and  at  the  same  time  results  in  a  decreased  separation  region  over 
the  upper  airfoil  surface. 

2.  Forcing  the  flow  at  the  first  subharmonic  also  delays  dynamic  stall  vortex 
formation  and  growth,  and  in  addition  causes  a  greater  reduction  in  the 
separation  region  over  the  surface  than  the  case  forced  at  the  characteristic 
frequency. 

3.  The  natural  instability  frequencies  of  the  flow  are  not  only  dependent  upon 
non-dimensional  pitch  rate,  but  also  on  free-stream  velocity. 

From  these  results,  it  can  be  expected  that,  by  actively  forcing  the  flow  at 
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the  first  subb.irmonic  or  lower  subharmonics,  enhanced  lift  can  be  maintained  for  a 
longer  period  of  time. 


7:  RECOMMENDATIONS 

Based  upon  the  results  obtained  in  this  study,  the  following 
recommendations  for  extended  research  are  o^^'-'red- 

1)  The  study  of  the  Kelvin- Helmholtz  insta''‘’;ity  requency  should  be  expanded 
to  include  the  effects  of  changing  free  stream  velocity  and  actual  pitch  rate. 

2)  A  thorougx.  pressure  representation  of  each  case  is  imperative  to  assess  the 
practical  benefits  of  the  active  control. 

3)  The  information  gained  from  active  control  studies  of  these  flows  should  be 
extended  to  include  interactive  forcing  and  Jrfoil  motion  controls,  to  allow  a 


maximization  of  the  benefits  of  active  control. 
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FIGURE  3. 


External  Acoustic  Control  System 


FREQUENCY 


AMPLITLiDE 


5(c):  Dynamic  Power  Spectra,  *  0.05 


FIGURE  5.  Power  Spectra  for  hot-film  data,  static  and  dynamic  airfoil 
flow. 
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Figure  6 .  Flow  v' sualization  at  d."  for  natural  flow  (0  Hz) 

and  flow  forced  at  the  characteristic  frequency  (15  Hz). 


I 


0  HZ 


33  HZ 


Figure  8*  Flow  visualization  at  ol”  =  0.05  for  natural  flow  (0  Hz’) 
and  flow  forced  at  the  characteristic  frequency  (33  Hz). 
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CHAPTER  1 


Introduction 

The  investigation  is  broken  into  two  parts.  First,  the  modeling  and  analysis  are 
carried  out  to  study  the  system  dynamics.  The  system  considered  is  a  cantilever  beam 
with  a  structure-borne  reaction-mass  actuator  (RMA)  attached  at  the  free  end.  The 
applied  forcing  function  is  arbitrary.  The  governing  partial  differential  equation  of 
motion  of  the  Euler-Bemoulli  beam  in  transverse  vibration  is  derived.  The  RMA  is 
modeled  as  two-point  concentrated  masses  connected  by  a  spring  and  a  viscous  damper. 
The  undamped  natural  frequencies  and  the  corresponding  normalized  mode  shapes  are 
determined  by  solving  the  eigenvalue  problem  where  the  RMA  (undamped)  is  considered 
as  a  part  of  the  boundary  condition.  The  forced  responses  for  both  the  undamped  and 
damped  cases  are  obtained  by  means  of  modal  analysis.  Both  frequency  response  and 
transient  response  are  considered.  The  system  under  consideration  is  classified  as 
passive  vibration  control  whose  vibration  suppression  capability  is  limited  compared  to 
active  vibration  control. 

In  the  second  part,  a  feedback  control  system  is  designed  to  suppress  the  vibration  in 
an  active  vibration  control  fashion.  The  control  system  design  is  based  on  modem 
control  theory  where  state  feedback,  output  feedback,  pole  placement,  performance 
index,  etc.  are  considered.  Important  aspects  of  control  system  design  such  as  stability, 
performance,  controllability,  observability,  reconstructability,  etc.  are  studied. 

The  control  design  is  broken  into  three  independent  steps.  The  first  step  is  a  regulator 
design  in  which  the  complete  state  is  assumed  available  for  feedback.  A  performance 
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index  is  defined  for  the  design  goal.  By  minimizing  the  performance  index,  the 
optimal  linear  quadratic  Gaussian  regulator  (LQR)  can  be  found.  Usually  the 
assumption  of  complete-state  feedback  is  not  practical,  an  estimator  has  to  be 
used  to  reconstruct  the  state  in  this  case.  This  is  the  second  step.  The  last  step 
is  to  connect  the  regulator  and  estimator  to  complete  the  design  (LQG  design). 
The  complete  system  is  called  output  feedback  control  system. 

Chapter  2  is  devoted  to  free  vibration  problem.  A  frequency  equation  is  ob¬ 
tained  by  solving  the  eigenvalue  problem.  Because  it  is  a  transcendental  equation, 
a  numerical  algorithm  is  employed  to  calculate  the  natural  frequencies  which  are 
the  roots  of  the  frequency  equation.  Each  natural  frequency  is  related  to  a  vibra¬ 
tion  mode  which  has  a  particular  mode  shape.  The  orthogonality  derived  shows 
that  each  mode  shape  is  orthogonal  to  the  others. 

The  forced  response  is  calculated  in  Chapter  3.  Lagrange’s  equation,  which 
takes  the  approach  of  energy  consideration,  is  used  to  derive  the  differential  equa¬ 
tion  of  motion.  By  making  use  of  the  assumption  of  modal  analysis  and  the 
or-thogonality,  a  set  of  uncoupled  ordinary  differential  equations  arc  obtained. 
The  solution  of  the  forced  response  is  in  analytic  form.  Modal  analysis  approach 
is  also  applied  to  the  damped  system  to  have  a  set  of  coupled  equations.  Howev'r, 
an  analytic  solution  is  not  available  to  these  equations.  If  we  put  these  equations 
in  state-space  form,  a  software  package  can  offer  the  numerical  solution.  The 
state-space  equation  is  also  convenient  for  control  analysis.  But  modal  analy¬ 
sis  transforms  the  continuous  system  into  an  infinite-degree-of-freedom  system 
in  terms  of  the  generalized  coordinates,  which  actually  can  not  be  measured  for 
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feedback.  This  problem  cam  be  solved  by  using  a  transformation  matri.x  which 
converts  the  equation  back  to  the  physical  coordinate  system.  The  physical  co¬ 
ordinates  are  chosen  at  positions  along  the  beam  where  the  vibration  needs  to 
be  suppressed.  The  state-space  equation  is  nondimensionalized  for  numerical  cal¬ 
culations.  Several  common  parameters  are  defined.  Case  study  is  at  the  end  of 
Chapter  3. 

Chapter  4  is  focused  on  the  regulator  design.  The  state-space  differential 
equation  derived  is  the  basic  model  for  control  theory  analysis.  The  dynamics  of 
the  reaction  mass  actuator  (RMA)  is  discussed.  The  stability  and  controllability 
also  have  been  analyzed  to  make  sure  the  design  is  possible.  In  the  fourth  section 
of  this  chapter,  optimal  feedback  gains  are  calculated.  The  sensitivity  analysis 
shows  that  the  control  system  has  enough  ability  to  compensate  the  parameter 
changes  in  the  system  modeling. 

In  Chapter  5,  a  full-order  estimator  is  studied.  The  purpose  of  introducing 
the  estimator  into  the  control  loop  is  to  reconstruct  the  state  of  the  system  when 
the  complete  state  feedback  is  not  available.  Also,  the  estimator  offers  certain  de¬ 
gree  of  filtering  of  the  noise  which  always  exists  in  the  practical  implementation. 
The  reconstructability  guarantees  that  the  estimator  design  is  possible.  The  es¬ 
timator  feedback  gains  are  calculated.  The  output  feedback  control  system  is  set 
up  by  combining  the  regulator  and  estimator.  Th  characteristics  of  the  complete 
system  totally  depends  on  the  regulator  and  estimator.  Therefore,  satisfactory 
response  should  be  expected  if  regulator  and  estimator  are  well  designed. 
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Case  study  is  based  on  a  simple  model.  First  two  modes  are  included  in  this 
model  which  requires  the  state  feedback  from  two  positions  along  the  beam.  We 
choose  X  =  0.5L  and  i  =  L  for  sensors’  positions.  The  numerical  results  are 
presented  in  the  case  study.  A  FORTRAN  code  is  written  and  used  to  solve  for 
the  natural  frequencies  and  calculate  the  coefficients  of  the  differential  equations. 
A  software  package,  MATRIXx,  is  used  to  calculate  the  system  response  and 
analyze  the  control  system.  The  FORTRAN  and  MATRIXx  programs  are  listed 
in  the  .appendix. 
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CHAPTER  2 


Modeling  of  Cantilever  Beam-RMA  System 

Frequency  Equation  and  Natural  Frequencies 

The  problem  of  determining  the  natural  frequencies  of  a  system  is  called 
the  eigenvalue  problem.  For  transverse  vibrations  of  a  beam  (Fig.  2.1),  Euler- 
Bemoulli’s  model  is  used  to  obtain  the  governing  partial  differential  equation.  In 
the  eigenvalue  problem,  the  damping  and  external  forces,  /(x,0  and  F{t).  are 
set  to  zero. 


The  Euler-Bernoulli’s  beam  equation  is  derived  by  considering  an  infinitesi¬ 
mal  element  dx  of  a  beam  and  the  internal  shear  forces  and  moments  acting  on  it. 
The  coordinates  and  free-body  diagram  are  shown  in  Fig.  2.2.  The  beam  theory 
provides  us  with  the  following  relations: 

d^wlx,  t) 

M>,  =  EIix)—^^  (2-la) 
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nx.o  =  -^ 


£/(x) 


d^w{x,  t) 


y 


(2-lb) 


The  mcLss  of  the  beam  element  is  pAdx  and  the  acceleration  of  the  element  is 
d'wjdt^.  Applying  Newton’s  second  law,  we  have 


^  Fj,  =  pAdx 


d^w 

'W 

dV  d^w 


Combining  Eqs.  2-1  and  2-3,  we  obtain 


52 

ai‘ 


EI{x) 


a^u;(i,  t) 

dx^ 


+  pA{x) 


d^w(x.t) 

~~W~ 


=  0 


If  the  beam  has  uniform  cross  section,  Eq.  2-4  reduces  to 

d^xii{xA)  pAd^w{x,i) 


dx^ 


-H 


EI  dt^ 


=  0 


(2-2) 

(2-3) 


(2-4) 


(2-5) 


Euler-Bernoulli’s  beam  equation,  Eq.  2-5,  describes  the  transverse  vibrations  of 
a  continuous  beam. 

The  boundary  conditions  at  i  =  0  au-e  easily  obtained  as 

u;(0,O  =  0  (2-6) 
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Figure  2.3  Free-Body  Diagram  at  x=L 


dw{Q,  t) 
dx 


(2-7) 


The  RMA  in  this  system  can  be  modelled  as  two  point-masses  connected  by  a 
spring.  The  free-body  diagram  at  i  =  Z-  is  shown  in  Fig.  2.3.  Because  the 
moment  at  x  =  Z  is  zero,  by  using  Eq.  2-1  we  have 


d^iv{L,  t) 
dx'^ 


=  0 


(2-S) 


Applying  Newton’s  law  to  both  masses,  A/  and  A/a,  we  obtain 


A/ 


AZ 


d-w{L,i) 

dt^ 

d'^Wait) 


=  -V{L,t)-F{t) 
=  F(0 


(2-9) 


(2-10) 


The  shear  force  V{L,t)  is  equal  to  —EId^w{L,t)/dx^  in  Eq.  2-1  and  the  spring 
force  F{t)  is  equal  to  l:[u;(Z,  t)  —  u;a(01-  Substituting  these  two  relations  into  Eqs. 
2-9  and  2-10,  we  get 
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d^w{L,t) 

dx^ 

“  ^2 


=  k[w{L,t)  -  w;a(0] 


(2-11) 


(2-12) 


Eqs.  2-6,  2-7,  2-S,  2-11,  and  2-12  are  the  five  boundary  conditions  of  the  system 
which  czui  be  used  with  the  governing  equation,  Eq.  2-5  to  solve  the  eigenv^ue 
problem. 

The  modal  analysis  approach  assumes  that  the  solution  of  Eq.  2-5  is  ex¬ 
pressed  as 


w{x,t)  =  (2-13a) 

w^{t)  =  (2-13b) 


where  <t>{x)  and  03,  together,  are  the  mode  shape  (also  called  mode  function)  of 
the  system  which  will  be  determined  in  the  third  section  of  this  chapter. 

Differentiating  Eq.  2- 13a  with  respect  to  x  and  t,  we  obtain  the  following 
derivatives: 


d*wix,  t) 
dx^ 

d'^w{x,t) 

^2 


dx* 


<p  (xje-' 


Introducing  Eqs.  2-14  and  2-15  into  Eq.  2-5,  we  have 


0""(i) 


pA  2 

- (jJ 

El 


<f>{x)  =  0 


Let  us  define  a  constant  A  as  follows 


A-* 


_  2 
“  El'^ 


(2-14) 


(2-15) 


(2-16) 


(2-17) 
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Later  in  this  section,  we  will  see  that  A  is  actually  the  eigenvcdue  of  the  system. 
Using  Eq.  2-17,  Eq.  2-16  can  be  written  as 

4>""{x)-X*<i>{x)  =  Q  (2-lS) 

By  using  modal  analysis  approach,  we  have  simplified  the  partial  differential  equa¬ 
tion,  Eq.  2-5,  into  a  fourth-order  ordinary  differential  equation,  Eq.  2-18,  which 
has  the  general  solution: 

<b{x)  =  AiCoshXx  -h  .42CosA2r  -I-  AzsinhXx  -f  ^445:''  \x  (‘--lO) 

Also,  substituting  Eq.  2-13  into  the  boundary  conditions,  Eqs.  2-6,  2-7,  2-S,  2-11. 
and  2-12  and  canceling  the  term  ,  we  obtaun 


0 

11 

0 

(2-20) 

0 

II 

o' 

(2-21) 

0"(I)  =  O 

(2-22) 

EI<p"\L)  =  -A/u;V(T)  +  k[S{L)  -  <Pa] 

(2-23) 

=  k[<t){L)  -  0ai 

(2-24) 

The  important  relation  between  <pa  and  <p{L)  can  be  derived  directly  from  Eq. 
2-24,  which  is 


0a  = 


k 

k  -  A/,u;‘ 


4>{L)  = 


ul  -u;2 


(2-25) 


where  u;*  =  k/Ma-  Substituting  Eq.  2-25  into  2-23,  we  get 

4>'"{L)  =  [mu;"  d-Afa-;  — -J  0(L)  (2-26) 

ni  u;*  —  J 

The  coefficient  term  on  the  right-hand  side  of  Eq.  2-26  can  be  arranged  into 
dimensionless  form.  Let  us  introduce  the  following  dimensionless  parameters: 


# 


# 


1 
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a  =  ‘  (mass  ratio  of  M  to  the  beam  mass  ) 

pAL 

p.  —  (mass  ratio  of  the  point -mzisses) 

A/ 


fa  = 


(tuning  ratio  of  the  RMA) 


(2-27a) 

(2-2Tb) 

(2-27c) 


Eq.  2-26  is  arranged  into  the  following  form 


^  ^  ^  El  pAL^  pAL  A/  ^  ^  ^ 

OAi.-*  fAL* 


(2-2S) 


By  substituting  the  dimensionless  parameters,  Eq.  2-27,  into  Eqs.  2-2S  and  2-25. 


we  have 


=  [7F(aIFJ 


(2-29) 


(2-30) 


The  four  coefficients  ,Ai,A2,A.  A4  in  Eq.  2-19  are  to  be  solved  by  using  Eqs. 
2-20,  2-21,  2-22,  and  2-20. 

Differentiating  Eq.  2-19  three  times  with  respect  to  x,  we  have 

(p{x)  =  AicoshXx  +  A2COSA1  +  AjsinhXx  -t-  A^sinXx  (2-19) 

d>'{x)  =  X{AisinhXx  —  A2-sinAx  +  A^coshXx  -f-  A4C0SAX)  (2-31) 

<p"{x)  =  X^{AiCoshXx  —  A2C05AX  +  AzsinhXx  —  A^sinXx)  (2-32) 

4>"'{x)  =  X^{AisinhXx  -f-  A^sinXx  -b  A^coshXx  —  A^cosAx)  (2-33) 

Applying  Eqs.  2-20  and  2-21  to  Eq.  2-19  and  2-31,  respectively,  we  get 


Aj  +  A2  —  0,  A2  —  ~Ai 

A3  -b  A4  =  0,  A4  =  —  .-13 


(2-34) 

(2-35) 
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Applying  Eqs.  2-22  amd  2-29  to  Eq.  2-32  aind  2-33,  respectively,  we  have 

(2-36) 

(2-37) 

Let  us  define 


A^(-4iCoshAIr  —  A2C0SXL  -r  A:isinh\L  —  A^sinXL)  =  0 
X^{A\sinhXL  -h  A2sinXL  -f-  AzcoshXL  —  A4C0SXL)  = 


(AX)-* 


a;[l  -f 


/; 


{A\coshXL  -h  A2C0SXL  -|-  A^sinhXL  A^sinXL) 


C  =  coshXL  c  =  cosXL 
S  =  sinhXL  s  =  sinXL 


A  =  a{XL) 


l  +  /i 


/, 


Pa-i^Ly\ 


Introducing  these  symbols  into  Eqs.  2-36  and  2-37  with  Eqs.  2-34  and  2-35,  we 
obtain 


(C  +  c)Ai  -r  (S  5)A3  =  0  (2-3S) 

{S  —  5)Ai  -t  (C  -f  c)A2  =  — >t((C  —  c).4i  +  {S  —  s).43)  (2-39) 

Rearranging  Eqs.  2-3S  and  2-39.  a  set  of  equations  will  be  obtained  to  be  solved 
for  A I  and  A3. 


(C  +  c) 

(5  +  3) 

A,' 

'o' 

_(5-s)4- A(C-c) 

(C  +  c)  + A(5-s) 

.  •'^3  _ 

0 

(2-40) 


In  order  to  get  non-trivial  solutions,  the  determinant  of  the  coefficient  matri.x  in 
Eq.  2-40  has  to  be  equal  to  zero,  i.  e.. 


(C  +  c)  (S  +  s) 

{S-s)^A(C-c)  (C-hc)-(- A(S-s) 


(2-41) 
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Expanding  the  determinant  we  get 

(C^  +  2Cc  +  c")  +  A{CS  —  Cs  +  cS  —  cs)— 

(S2  _  s^)  +  AiCS  +  Cs-cS-cs)  =  Q 

By  using  the  relations,  —  5^  =  1  and  =  1,  the  above  equation  can  be 

written  as 

Cc+l->l(sC-c5)  =  0  (2-42) 


Substituting  the  original  terms  into  Eq.  2-40  and  dividing  Eq.  2-24  by  Cc.  we 
obtain 


1  + 


1 


coshXLcosXL 


-a{XL) 


l+fi- 


fa 


(tanXL  —  tanhXL)  =  0  (2-43) 


The  above  equation  is  the  frequency  equation  of  the  system  which  can  be  solved 
for  the  eigenvalue,  A  .  There  is  an  infinite  number  of  roots  satisfying  this  equation 
and  each  A„(n  =  1,2,...,  oo)  is  related  to  a  vibration  mode  of  the  system  where 
the  corresponding  natural  frequency  is 


Un  =  (A„I) 


2  /  El 


n  =  1, . . . ,  CO 


(2-44) 


pAL* 

If  fa  =  0,  i.e.,  the  stiffness  of  the  RMA  is  reduced  to  zero,  then  the  system 
(Fig.  2.3)  becomes  a  system  shown  in  Fig.  2.4.  The  frequency  equation  for  this 
system  can  be  derived  eaisily  from  Eq.  2-43  by  substituting  /„  =  0. 

1 


or 


1  4- 


1  + 


coshXLcosXL 

1  AIuj^ 


—  a{XL){tanXL  —  tanhXL)  =  0 


coshXLcosXL  EIX^ 


(tanXL  —  tanhXL)  =  0 


(2-45) 


(2-46) 
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Figure  2.4  Cantilever  Beam  with  Point-Mass  M  only 


Solving  the  Frequency  Equation 


From  the  result  of  the  preceding  section,  we  know  that  the  natural  frequencies 
of  the  system  are  obtained  by  solving  the  frequency  equation.  Eq.  2-43. 


tAt, 


'  +  cosiarcosXL  -  ['  +  ° 


Eq.  2-43  is  a  transcendental  equation.  It  is  impossible  to  solve  this  equation 
analyticly.  A  numerical  method  has  to  be  employed  to  solve  for  the  roots.  An 
algorithm  called  Modified  Linear  Interpolation  Method  [l]  is  used  for  solving  this 


equation. 


Before  writing  the  FORTRAN  computer  code,  we  have  to  arrange  the  equa¬ 
tion  to  avoid  the  arithmetic  overflow  problem.  Multiplying  Eq.  2-43  by 

COS  XL  * 

(Air  +  1 


we  obtain 


+  ^  coshXL^ 

a{\L)  ^ [\L)^  ^  {sinXL  —  cosXLtanhXL)  =  0 


(2-47) 
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The  program  is  listed  in  the  Appendix.  The  transcendental  equation  2-47  is 
programmed  in  the  FUNCTION  subroutine.  For  a  different  equation,  only  this 
subroutine  needs  to  be  modified.  Also,  the  program  has  loops  for  different  cases 
of  tuning  in  the  RMA  which  are  set  in  the  program  constants  FMIN,  DF,  and 
FMAX.  The  output  of  the  program  are  the  eigenvalues  mulitplied  by  the  length 
of  the  beam.  The  number  of  eigenvalues  depends  on  the  frequency  scanning  range 
which  is  set  by  the  constants  XMIN,  DX,  and  XMAX.  For  each  eigenvalue,  the 
corresponding  natural  frequency  is  calculated  by  Eg.  2-44. 

If  we  set  the  following  parameters  as 

a  =  0.25,  /i  =  1.0 

XMIN  =  0.001,  DX  =  0.1,  XMAX  =  15. 
fa  =  0,  1.8,  17.6 

the  eigenvaJues  calculated  are 


case 

A 

All 

A^I 

AjI 

A4I 

Asl 

Ael 

1 

0.0 

1.57 

4.23 

7.28 

10.4 

13.5 

— 

2 

l.S 

1.23 

1.72 

4.23 

7.28 

10.4 

13.5 

3 

17.6 

1.42 

3.99 

4.91 

7.30 

10.4 

13.5 

The  first  case  is  /a  =  0  which  means  that  there  is  no  mass  Ma-  This  special  case 
is  the  system  shown  in  Fig.  2-4.  In  the  second  and  third  cases  the  RMA  is  tuned 
to  the  first  and  second  modes  of  the  first  case.  We  can  see  in  the  second  case  that 
the  first  mode  of  case  1  is  splited  into  two  modes  with  little  effect  on  the  others. 
The  third  case  tuning  to  the  second  mode  of  case  1  shows  the  same  phenomenon. 

Orthogonality  and  Normalized  Mode  Shanes 

From  the  first  section  of  this  chapter,  we  see  that  there  are  infinite  vibration 
modes  related  to  a  continuous  system.  In  this  section  we  are  going  to  determine 
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the  mode  shape  for  each  vibration  mode.  Also,  we  are  going  to  show  that  mode 
shapes  satisfy  the  orthogonaJity.  By  using  the  orthogonality,  the  mode  shapes  are 
normalized. 

Based  on  the  assumption  of  modal  analysis  approach,  the  solution  of  a  free 
transverse  vibration  of  the  system  is 

w{x,t)  =  4>{xy'^^  (2-13a) 

u;a(f)  =  (2-13b) 

where  (^(x)  and  <pa  are  the  mode  shapes  and  have  been  obtained  in  the  first  section 
as 


4>{x)  =  AicoshXx  +  AjcosAi  +  AzsinhXx  +  A^sinXx 

J  a 


<Pa  = 


<^(L) 


./a‘-(AI)r 

.A.lso,  from  the  first  section,  we  can  have 


(2-19) 

(2-30) 


-42  =-.4,  (2-34) 

-4^  =  —.43  (2-35) 


And,  Eq.  2-38  can  be  rewritten  as 

,43  coshXL  -I-  cosXL 

_ ±  _ _ ! _  (O.4S) 

Aj  sinhXL  -h  sinXL 

Substituting  these  relations  in  Eq.  2-19  and  2-30,  and  denoting  the  mode 
with  subscript  n,  we  can  have  the  mode  shape. 


<i>r,{x)  =  (2-4Sa) 

—  (^n4>n{Li)  (2-4Sb) 
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V  Here 


hn{x)  =  {cOshXnX  —  COsXri^)  ~ 


coshXnL  +  cosX„L 
sinhXnL  +  sinX„L 


{sinhX„x  —  sinXnx) 


"  '  fl-{KLY 

and  Cn  axe  an-bitrary  constants  to  be  determined  later. 


Ne.xt,  we  will  show  that  each  mode  shape  in  Eq.  2-48  satisfies  the  orthogo¬ 
nality  ocndition.  We  start  to  derive  the  orthogonality  in  a  general  way  (  the  cross 
section  of  the  beam  may  not  be  uniform).  Now,  we  go  back  to  the  differential 
equation  of  the  system  which  is 


f  5^10(1, f)]  d^w(x,t)  ^ 


Substituting  Eq.  2-13a  into  2-4,  we  obtain 


[r-r/  ^  2  X 

3?  =M(XVC(X) 


The  m‘^  and  n‘^  mode  shapes  must  satisfy  Eq.  2-49.  Hece, 


(2-4) 


2-49) 


dx2  dx'^4>rnix) 


=  pA{x)ijl,(t>m{x) 


(2-50) 


(2-51 : 


Multiplying  Eq.  2-50  by  0n(x),  and  integrating  with  respect  to  x  from  0  to 
L,  we  obtain 


P  .  ,  ,cp  r  , 


pA{x)ul^(f>,n{x)(prt{x)dx  (2-52) 
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Integrating  the  left-hand  side  of  Eq.  2-52  by  parts  and  using  the  boundary  con¬ 
ditions  Eqs.  2-20,  2-21,  2-22,  we  get 


dx 


EI{x) 


(P<f)mix) 


d4>nix)  d 


dx 

rL 


EI{x) 


d}4>m{x) 

dx^ 


-i:  ^ 

=  u;^  /  pA{x)(f>m{x)4>n{x)dx  (2-53a) 
Jo 


<Pr,{L)EI{x) 


d^<t>m(L)  d(i)nix) 


dx^  dx 


dx''- 

L 


dx^ 


dz^ 


pA{x)d,n{x)d„{x)dx  (2-53b) 


dz-*  jQ 


i  ^ 

=  ^m  I  pA{x)<pm{x)<Pn{x)dx  (2-53c) 

Jo 


Starting  from  Eq.  2-51  and  repeating  a  similar  derivation,  we  can  ha%-e  the 
following  result: 


<i>m{L)EI 


dUn{L) 

dx^ 


f 


+  /  El 


<f4>n{x)  (P<i>m{x) 
dx^  dz^ 


dx 


=  U}1  pA6mix)<i>n{x)dx 
Jo 

Substracting  Eq.  2-54  from  2-53c,  we  obtain 


(1-54) 


0  =  (‘-'m  f  PM^)<Pmix)<p„ix)dx  +  EI{l)[(i>rr,{L)4>'”{L)  -  <t>„{L)6';^^L)] 

Jo 

(2-55) 
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The  relation  between  <p{L)  and  <p"'{L)  is  given  by 


r(L)  =  -j4-:  [a/w^  +  A/,-^^1  *(i) 

LI{x)  I  —uj^ 


(2-26) 


With  this  relation,  the  second  term  on  the  right-hand  side  of  Eq.  2-55  can  be 
arranged  eis  follows 


Ei{x)[4>rn{L)6'::{L)-4>niL)<i>'::iL)] 
=  M{ul^-ul)  + - 

=  iS-  -  u;- )  [  V/  -f  \L - ^ 


e 

3 

1 

[ 

-J  1 

i  E 

1  3 

1 

LJ 

- — — ) 

o  o  / 


dm(  L')Qn  (-^  ) 


(2-56) 


Substituting  this  result  back  to  Eq.  2-55,  we  come  to  the  final  equation  as 


0  =  (^m  “  I  ^  pA{x)4>m{x)4>n{x)dx  -h 


-V/  +  - -1  0,n{L)<P^{L) 

i^a  -^n)\ 


From  Eq.  2-57,  we  can  get  the  orthogonality  condition  of  the  system  as 


(2-57) 


/  pA{x)0mix)((>n(x)dx-{- 

Jo 


M  -f  J  2 - 27  ®m(I)d„(I)  =  0,  (m  n)  (2-5Sa) 

-^n)J 

I  pA{x)0l{x)dx .V/-t-A-/a  ,  ,  0l{L)^O,  (m  =  n)  (2-5S) 

0  L  } 


When  m  =  n,  Eq.  2-58  is  equal  to  a  positive  quantity.  We  can  normalize  the 
mode  shapes  by  taking  this  quantity  as  unity.  If  we  consider  the  case  that  the 
cross  section  of  the  beam  is  uniform,  by  taking  the  term  pAL  out  of  the  integral  in 
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Eq.  2-58  and  arranging  the  term  in  the  bracket  with  Eq.  2-27  into  dimensionless 
parauneters,  we  have 

pAL  f  <f>mix)(f>n{x){dx/L) +a{l  +  fia,rxan)<i>miL)<f>„{L)  =  Smn  (2-59) 
Jo 

where  6mn  is  the  Kronecker  delta.  The  mode  shapes  satisfying  Eq.  2-59  are 
referred  to  as  the  normalized  mode  shapes.  The  coefficients  €„  (see  Eq.  2-4Sa) 
of  the  normalized  mode  shapes  can  be  derived  from  Eq.  2-59  as 


Cn  ={  pAL 


I  hl{x){dx/L)  +  Q{l+ pal)hliL) 

Jo 


:2-60) 


The  first  four  normalized  mode  shpaes  for  the  czise  of  /a  =  I  S  are  sho%vn  in 


Fig.  2.5. 
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xlL 

(a) 


x/L 

(b) 


x/L 

(d) 

Figure  2.5  Normaliz  ’  Mode  Shapes  (a)  First  Normalized  Mode  Shape 
(b)  Second  Normalized  Mode  Shape  (c)  Third  Normalized  Mode  Shape 
(d)  Fourth  Normalized  Mode  Shape 
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CHAPTER  3 


Forced  Response  of  the  System 


Forced  Response  of  the  Undamped  System 

In  this  section  we  study  the  response  of  the  undamped  system  subjected  to 
the  sinusoidal  input.  Refering  to  Fig.  2.1,  if  the  external  forces  applied  to  the 
system  axe  not  equal  to  zero,  the  system  response  subjected  to  these  forces  is 
called  the  forced  response.  The  solution  of  the  forced  response  is  assumed  in  the 
form  of  eigenfunction  expansion. 


w{x,t)  =  <f>n{x)qnit) 

n=l 

oo 

Wait)  (3-1) 

n=l 


where  the  eigenfuncitons.  6n{x)  and  an<t>n{L).,  are  the  same  as  the  mode  shapes 
which  have  been  solved  in  Chapter  2  and  the  generalized  coordinates,  gnit),  will 
be  determined  in  this  section. 

We  take  the  approach  of  energy  consideration  to  derive  the  differential  equa¬ 
tion  for  the  generalized  coordinates  qn{t).  Also,  the  orthogonality  derived  in  Sec¬ 
tion  2.3  will  be  used  to  simplify  the  derivation.  Stairting  from  Lagrange’s  equation 
expressed  in  terms  of  the  generalized  coordinates,  we  can  write  the  diferential 
equations  ais  follows 


dt  dqn  dq„  ^  dqn 


Qn(t)  n  =  1, . . . ,  oc 


(3-2) 


21 
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where  T  ajid  V  are  the  total  kinetic  and  potential  energy  of  the  system  respec¬ 
tively,  and  Qn(i)  3se  the  generalized  nonconservative  forces. 

For  the  system  considered,  the  kinetic  energy  of  the  beam  is 


n(t)  =  2 1  M  [— aj— J 


and  the  kinetic  energy  of  the  RMA  is 


(3-3) 


rR(()  =  liV/ I 

Hence,  the  total  kinetic  energy  of  the  system  T(i)  is  equal  to  Ti(i)  -f 


(3-4) 


T(()  =\[  pA  +  jW.WOI’  (3-5) 

Substituting  the  solution  Eq.  3-1  into  the  above  equation  and  taking  the  term 
pAL  out  of  the  summation  notations,  we  have 


OO  OO  *  M  [j 

^  /  <l>n{x)<Pm{x){dx/L) 

"  n=  1  m=  1  .  ® 


a(l  +  pa„arn)<i>„{L)(l>m(L)]  qn{t)qmii)  (3-6) 


where  the  over-bar  denotes  the  dimensionless  form  of  the  mode  shapes. 

Examining  Eq.  3-6,  we  can  see  that  the  te^-m  inside  the  summation  signs 
satisfies  the  orthogonality  derived  in  Section  2.3.  We  know  that  only  in  the  case 
of  n  =  m  the  term  inside  the  bracket  is  equal  to  unity  ,  otherwise,  the  term 
vanishes. 


f  'i>„(x)(i>mix)(dx/L)  +  q{1  +  pa„am)<i>n{L)r>m{L)  =  6mn  (2-59) 

Jo 
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Therefore,  Eq.  3-6  can  be  simplified  as 

rw  =  5  f;  <?"„(() 

“  n=l 

The  potential  energy  of  the  beam  is  given  by  the  following  integral 


(\l 

w{x,  t) 

Jo 

dx^ 

dx 


ajid  the  potential  energy  of  the  RMA  is 


1 


-  Wa{t)f 

Hence,  the  total  potential  energy  of  the  system  V{t)  is  equal  to  Vi,it)  ~  V 


Substituting  the  solution  Eq.  3-1  into  the  above  equation,  we  obtain 

oo  oo  -i 

=  5E  El/  rK(x)CW‘ix+ 

n  =  1  m=  1 

k{l  -  a„)(l  -  am)<Pn{L)dm{L)]  q„it)qm[t) 
With  Eqs.  2-54  and  2-26,  Eq.  3-11  can  be  written  as 


-  OO  OO 

''■(‘J  =  j  E  E 

n=  1  m=  1 

I  ^  dx  f  L)  -}-  0(1  -}-  /iQ„Q,n  )0n  (  E  )(p,7j  (  T  ) 

Jo 


qn[^)qm{^ 

Again,  we  use  the  orthogonality  Eq.  2-59  to  simplify  Eq.  3-12  to 


n=sl 


(3-7) 

(3-8) 

(3-9) 

r(0- 

(3-10) 


(3-11) 


) (3-12) 

(3-13) 
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Substituting  the  total  kinetic  and  potential  energies,  T{t)  and  V  (t)  in  Eqs. 
3-7  and  3-13,  into  the  Lagrange’s  equation  3-2  and  performing  the  differentiations, 
we  can  have  a  set  of  infinite  number  of  independent  differential  equations,  which 
are 

Qnit)  +<^lqn{t)  =  Qrxit)  n  =  l,...,00  (3-1^' 

where  Qn{i)  are  the  nonconservative  forces  of  the  system. 

In  this  section  we  consider  only  the  external  forces,  /(x,  t)  and  E'ftldfx  —  I), 
(no  dampnig  force)  8ls  the  nonconservative  forces  of  the  system  Thp  work  done 
on  the  system  by  the  external  forces  during  a  virtual  displacement  is 

AIF(t)  =  /  f{x,t)Aw{x,t)dx  +  F{t)S{x L)Aiv{x,t) 

Jo 

OO 

=  r 

n=  1 

Also,  the  work  done  on  the  system  by  the  generalized  nonconser-’ative  forces 
is 

OO 

AIF(t)  =  (3-16) 

n=  I 

where  the  subscript  e  denotes  the  external  forces.  By  comparing  Eq.  3-15  to  3-16. 
the  generalized  nonconservative  forces  are  obtained  as 

{Qe)nit)=  f  f{x,t)<t>nix)dx  F{t)4>n{L)  H  =  1,  .  .  .  ,  OC  (3-17) 

Jo 

Taking  the  Laplace  transform  of  Eq.  3-14  with  zero  initial  conditions,  7„(0)  = 
0  and  7n(0)  =  0,  we  can  get  the  solution  of  Eq.  3-14  as 


f(x,t)<p.,{x)dx  +  F(t)4>n(L) 


(3-15) 
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1  r 

qn{t)  =  -  /  {Qt)n{i)sinu„{t  -  T)dT  71=1,. ..,oo  (3-lS) 

JO 

Eqs.  2-48,  3-1  and  3-18  are  the  complete  solutions  of  the  undamped  system  with 
the  external  forces,  /(i,  t)  aJid  F{t)S(x  —  L). 


Forced  Response  of  the  System  with  Viscous 
Damping  in  RMA 

In  this  section  we  will  study  the  damped  response  of  the  system  subjected 
to  the  external  forces.  A  dash-pot  is  introduced  into  the  RMA.  that  is  to  say. 
the  damping  inside  the  RMA  is  not  equal  to  zero  (Fig.  3.1).  The  damping  forces 
mainly  due  to  the  friction  of  the  bearing  are  applied  to  both  masses,  M  and  .V/a , 
with  the  same  magnitude  but  opposite  directions,  which  is  modelled  as 

=cMl,0-u;,(0l  (3-19) 


Figure  3.1  System  with  Dash-Pot  in  RM.A 


To  solve  for  the  damped  response  of  the  system,  we  use  the  same  approach  as  we 
did  in  the  preceding  section.  .Assume  the  solution  is  in  the  form  of  eigenfunction 
expansion; 
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OO 

H^,t)  =  4>n{x)qn{t) 

n=l 

OO 

iWoCO  =  X]«n^n{i)gn(0  (3-1) 

n  =  l 

where  <f>n{x)  and  an4>n{L)  are  the  mode  shapes  and  the  qn{t)  are  the  generalized 
coordinates. 

Based  on  the  approach  of  energy  consideration,  the  Lagrange’s  equation  is  the 
differential  equation,  which  is  expressed  in  terms  of  the  generalized  coordinates 
as 


dt  dqn  dqn  dqn 


Q„(f)  n  =  l . CO 


(3-2) 


Because  the  derivation  of  the  differential  equation  is  the  same  as  the  one  in  the 
preceding  section  except  the  for  generalized  nonconservative  forces,  we  just  rewrite 
the  result  here. 

The  total  kinetic  and  potential  energies  are  listed  below  respectively. 


m  =  i  f;  ii{t)  (3-7) 

“  n=l 

F(()  =  (3-13) 

**  n=l 

The  generalized  nonconservative  forces  subjected  to  the  external  forces  f{x,t) 
and  F{t)6{x  —  L)  are 


X,t)Ortix)dx  +  F(t)0n(L) 


(3-17) 
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The  damping  effect  of  the  dash-pot  in  the  RMA  is  considered  cis  the  non¬ 
conservative  force.  The  work  done  on  the  system  by  the  damping  force  during  a 
virtual  displacement  ,  Aw{L,t)  and  is 


Substituting  Eq.  3-1  into  3-20,  we  have 

OO  OO 

Ak7(t)  =  -  5;]  5]  [c(l  -  a„)(l  -  a^)<i>„{L)<t>„,{L)q,r.{t)]Aqr,it)  (3-21) 

n=  1  m=  1 

The  work  done  on  the  system  by  the  generalized  nonconservative  forces  (Qj)„(t) 
is 

OO 

AW^(t)  =  (3-22) 

n=l 

Comparing  Eq.  3-22  to  3-21  we  can  see  that 

OO 

(Q<i)n(0  =  -  -  an)(l  -  am)<?!>n(T)';^m(I')gm(0  ^  =  1 . CC  (3-23) 

m=  I 

Then  the  total  generalized  nonconservative  forces  are 


[Au;(I,0- Au;a(0]  (3-20) 


AW{t)  =  -c  —(L,t)-w,it) 


Qn{t)  =  {QMt)  +  {Q,Ut) 

L 

f{x,t](i)nit)dx  +  F{t)<t>n{L)- 

OO 

c(l  -a„)(l  -  a^)4>„{L)<pm{L)qr„{t)  n  =  1, . . . ,  co  (3-24) 

m=  1 

Substituting  Eqs.  3-7,  3-13.  and  3-24  into  the  Lagrange's  equation  EQ.  3-2.  we 
have 
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quit) +ulqn{t)  =  Q  nit)  =  Qtnit)  +  Qdn{t)  n  =  l,...,oc  (3-25) 

Because  the  Qdnit)  is  a  function  of  qm  (m  =  1,...  ,oo),  we  can  move  (Qd)n(t)  to 
the  left  of  the  equal  sign  of  Eq.  3-25  to  get 

?n(()  +  Wj)n(()+‘^29„(i)  =  («.)»  n=l,...°o  (3-26) 

Let  us  denote  the  column  matrix  of  the  generalized  coordinates  by  q(t)-  The 
above  equation  can  be  written  as 

Iq(0  +  Cq(<)  +  fiq(0  =  Q*(0  (3-27) 

where  I  is  the  identity  matrix.  And 

q(0  =  ki(0>92(0,---,9n(01^ 

^  ~  [^-mn]  ^mn  —  c(  1  £ln)(l  Qm  )0n(T)0rn  (T) 

Q.(<)  =  ((<3.)i(().(<3.)2(0 . (<3.)n(!)r 

In  Eq.  3-27  we  also  have  a  set  of  infinite  number  of  differential  equations.  Unfor¬ 
tunately,  these  are  coupled  equations.  The  closed-form  solution  of  these  equations 
could  not  be  obtained.  Instead,  the  numerical  software  package  needs  to  be  used 
to  provide  the  numerical  soluton  of  the  equation. 

For  numerical  calculations,  the  system  has  to  be  truncated.  Therefore,  the 
inifinite-degree-of-freedom  system  becomes  2m  N-degree-of-freedom  system.  We 
can  have  more  accurate  smswers  by  including  more  terms  in  the  series. 

After  solving  the  generalized  coordinates,  qn(t),  the  displacements  at  any 
position  along  the  beam  and  the  mass,  Ma,  can  be  calculated  from  Eq.  3-1.  If 
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we  want  to  have  the  differential  equation  of  the  system  in  terms  of  the  physical 
coordinates  instead  of  the  generalised  coordinates.  A  transformation  matri.x 
defined  below  can  do  the  work.  Let  us  define  a  matrix,  G,  as 


where 


an 

ai2 

■  ■ .  ais  ' 

021 

022 

■  a2N 

(3-2S) 

a/vi 

aN7 

...  asN- 

mn  “ 

4>n{x 

II 

and  N  is  the  truncated  size  of  the  infinite-degree-of-freedom  system.  Therefore, 
the  relation  between  the  physical  and  generalized  coordinates  is 


w(t)  =  Gq(t) 


(3-29) 


where 


w(0  =  [wi^  ,t),w{-^L,t), . . .  ,iv{L,t)y 

q(0  =  [9i(0t92(0T---.9iv(0r 


The  differential  equation  in  terms  of  the  physical  coordinates  defined  above  is 

w(f)  -t-  GCG“‘w(t)  +  GnG-'w(f)  =  GQ,{t)  (3-30) 


Eq.  3-30  can  be  reduced  to  the  undamped  system  by  setting  the  damping  c  =  0. 
which  is 

w(f)  +  GfiG-‘w(<)  =  GQe,f)  (3-31) 
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Nondimensionalization  and  Case  Studv 

In  order  to  facilitate  the  numerical  calculations,  the  differential  equation  of 
the  system  is  usually  nondimensionalized  and  expressed  in  the  state-space  form. 
Let  us  define  the  following  dimensionless  parameters: 

^  ^  |F(t)l  ^  \F(t)\ 

pALlj^  pALuii  pAL 


F(t)  = 


\m\ 


(3-32) 


where  u,’6  =  ^/{EI)/{pAL*). 

And  the  dimensionless  mode  shapes  are  defined  cls 


<Pn{x)  =  C„hnix)  <^a  =  a„C„hn{L) 


where 


(3-33) 


,,,,,,  .  ,  COshXnL  +  COsXnL  ,  .  ,  .  .  .  V 

hnix)  =  {coshAnX  —  cosA„i) - - — ; - ; — ; — -{sin/iA„x  —  smA„x) 

stnhXnF  -I-  sinXnL 


^ 

C„  =  =  /  hi 

Jo 


{x)idx/L)  +  a{l+pal)hl{L) 


n  -  {\Lr 

then,  the  transformation  matrix,  G,  is  dimensionless.  For  consistancy  we  denote  it 
as  G.  The  dimensionless  natural  frequencies  are  defined  cis,  u-n/^b  =  ,  n  = 

1. 2. . . . ,  oo.  Hence,  the  dimensionless  matrix,  fl,  is 


n  =  diag[{X„Ly] 

Recall  the  mass  ratio  parameters  defined  before  eu-e 


(3-34] 
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If  the  dcimping  ratio,  (,  is  defined  cls 

C  =  7^  (3-35) 

2iV/au;fc 

the  coefficient  matrix,  C,  can  be  arranged  in  dimensionless  form  as 

C  =  [Cmnj  Cmn  =  2a^<(l  -  am)(l  -  an)'^m(^)'Pn(i)  (3-36) 

And  the  dimensionless  external  force  matrix  Q*  is 


(3-37) 


Q.  =  [(Ojnl 

(Q.)n=  [  f{x,t)o{x)dx  +  F{mL) 

Jo 

With  these  dimensinless  parameters,  the  differential  equaiton  3-30  is  arranged 
in  dimensionless  form  as 

i^{t)  +  GCG-‘^(0  -h  Gnw-^(0w(0  =  GQ<(0  (3-3S) 

Let  us  define  the  state  vector,  x(t),  as 


x(0  =  [u'(  0<  u^(  .  ML,  t),  w{  ^,t),ivi^Lj) . li-iL.t)]^  (3-30) 


The  state-spac'  brm  of  the  differential  equation  in  terms  of  the  physical  coordi¬ 
nates  x{t)  is 

x(<)  =  Ax(<) -f- P(0  (3-40) 


where 


x«)  = 


dx 

It 


A  = 

I 

Gfiw 

0 

GCG"' 

P(0  = 

0 

GQ, 
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Case  Study 

Two  computer  programs  are  written  for  the  numerical  calculations.  The  first 
one  written  in  FORTRAN  computer  code  is  for  czJculating  the  numerical  values  of 
elements  of  the  coeflScient  matrices  of  the  state-space  equation  3-40.  The  second 
one  is  for  MATRIXx  input  which  is  used  to  calculate  the  response  of  the  system 
subjected  the  external  forces.  (See  Appendix.) 

The  following  examples  axe  calculated. 


cast 

I  a 

fa 

C 

1 

0.25 

1.0 

0 

0 

2 

0.25 

1.0 

1.8 

0 

3 

0.25 

1.0 

1.8 

0.5 

The  frequency  and  impulse  response  of  the  displacement  w{L,  t)  is  plotted  in  Figs. 
3.2,  and  3.3. 


1  le 

Figure  3.2  Frequency  Response  of  the  Undamped  System 
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CHAPTER  4 


Active  Vibration  Absorber  Design 

State  Differential  Equation  with  Contorl  Input 

The  mathematical  model  of  the  beam-RMA  system  has  been  derived  in  Chap¬ 
ter  3.  In  this  section  we  analyze  the  effect  of  control  input  to  the  system  and  derive 
the  differential  equation  for  the  system  which  is  called  the  open-loop  control  sys¬ 
tem.  The  derivation  is  bcised  on  the  approach  of  energy  consideration  which  is 
the  same  ais  what  we  did  in  Chapter  3.  The  purpose  of  incorporating  the  RMA 
to  the  beam  is  to  generate  the  control  force  which,  by  its  nature,  applys  to  both 
masses  M  and  Ma  with  the  same  magnitude  but  opposite  directions.  (Fig.  4.1) 

The  RMA  actually  is  a  DC  motor  from  which  force  is  generated  by  electro¬ 
magnetic  effect.  Consider  a  charge  q  moving  with  velocity  V  in  a  magnetic  field  of 
intensity  B  e.xperiencing  a  force  u  given  by  the  vector  cross-product  u  =  qV  x  B. 
Because  the  current  i  is  equal  to  dq/dt,  the  force  u  has  the  magnitude 

u{t)  =  Bli{t)  (4-1) 

if  the  direction  of  current  is  at  right  angle  of  magnetic  field.  Eq.  4-1  is  the  law  of 
motors.  Note  that  the  control  force  is  proportional  to  the  current  input  and  the 
direction  of  the  force  also  depends  on  the  direction  of  the  current. 

Consider  the  model  in  Fig.  4.1,  the  derivation  of  state  differential  equation  is 
the  same  as  the  procedure  has  gone  through  in  Chapter  3  except  that  we  need  to 
add  the  effect  of  the  control  force  which  is  nonconservative.  From  the  approach 
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Figure  4.1  Beam-RMA  System  with  Control  Force  Input 

of  energy  consideration,  the  virtual  work  done  on  the  system  by  the  control  force 
u(t)  during  a  virtual  displacement  Aw{L,t)  and  Awa{t)  is 

AIF(t)  =  u{t)[Aw{L,t)  -  Aiya{t)]  (4-2) 

By  the  assumption  of  eigenfunction  expansion,  the  solution  can  be  expressed  as 

OO  CO 

u;(l,0  =  ^  d„(l)9„(t),  Wait)  = '^an(!>niL)qn{t)  (4-3) 

a=l  n=l 

By  substituting  Eq.  4-3  into  4-2,  Eq.  4-2  can  be  written  as 

OO 

Air(t)  =  ^u(t)(l-a„)d„(i:)Ag„(t)  (4-4) 

a=l 

Also,  the  work  done  on  the  system  by  the  generalized  nonconser\’ative  forces 
Qcn{t)  is 

OO 

AkF(t)  =  Zo  cn(0Ag„(0  (4-5) 

n=l 

Comparing  Eq.  4-4  to  4-5,  we  obtain  the  generalized  nonconser\'ative  forces  due 
to  the  control  input  u(t)  as 
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Qcn{t)  =  {I  -  ar,)(pniL)u{t)  n  =  l,...,co  (4-6) 

Arran^ng  Eq.  4-6  into  the  matrix  form  and  adding  it  to  the  system  equation 
3-27  in  Chapter  3,  we  have 

Iq(/)  +  Cq(f)  +  Qq{t)  =  Q,(0  Q,(t)  (4-7) 

where 

Because  the  generalized  coordinates  q  (t)  cam  not  be  measured  for  feedback, 
we  need  to  transform  Eq.  4-7  into  am  equation  in  terms  of  the  physical  coordinates 
x(t)  which  cam  be  sensored  by  the  accelerometers.  This  has  already  been  done  in 
Chapter  3.  We  just  quote  the  result  amd  apply  it  to  Eq.  4-7  to  have 

w(t)  +  GCG-^w(0  +  GQG-‘w(0  =  GQ,(0  -f  GQc(t)  (4-S) 

Agaiii,  through  the  same  procedure  to  nondimensionlize  Eq.  4-S,  we  can  have 

^{t)  -t-  GCG-‘^(t)  +  GaG-‘w(t)  =  GQJt)  +  GQc(t)  (4-’j) 

where  the  elements  of  Qc(i)  are 

The  state-space  form  of  Eq.  4-9  is 

x(0  =  Ax(t) -i- Bu(t) -i- P(f)  (4-10) 
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where  xU),  A,  axid  P(f)  axe  described  in  Section  3.3,  and 


B 

Q'c 


0 

GQo 


u(0  = 


u{t) 


\m\ 

[(1  ~  Q]  (1  —  a2)<i>2{L),  . 


,.,(1  -  a„)<^„(I)]^ 


Eq.  4-10  is  a  linear  time-invariant  system  because  the  coefficient  matrices  A  and 
B  axe  constaxit.  We  will  use  it  as  the  mathematical  model  to  derive  the  feedback 
control  law. 


Stabihtv  and  Controllability 

In  this  section  we  are  interested  in  the  overall  time  behavior  of  the  differential 
system  to  see  whether  or  not  the  solutions  of  the  state  differential  equation  tend 
to  grow  infinitely  as  f  — »  oc.  This  is  called  th^  rtability  of  the  system. 

For  solving  the  control  problem,  it  is  also  important  to  know  whether  or  not 
the  system  has  the  property  that  it  may  be  steered  from  any  given  state  to  any 
other  given  state.  This  leads  to  the  concept  of  controllability  of  the  system. 

Before  starting  to  design  the  control  system,  we  should  discuss  both  stabil¬ 
ity  and  controllability  to  see  how  stable  the  system  is  anu  make  sure  that  u  is 
controllable. 

The  state  differential  equation  for  the  system  with  control  force  input  has 
been  derived  in  preceding  section  which  is 

x(fj  =  Ax(<)  +  Bu(0  +  P(f)  (4-11) 

which  is  a  linear  and  time-invariant  system  with  dimension  2N.  From  control 
theory  we  know  that  the  stability  of  the  system  4-11  depends  on  the  coefficient 
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matrix  A.  The  system  4-11  is  said  to  be  exponentially  stable  if  and  only  if  all  the 
eigenvadues  of  the  matrix  A  have  strictly  negative  real  parts.  That  is  to  say,  all 
the  poles  of  the  open-loop  control  system  have  to  be  within  the  left-hand  side  of 
the  complex  plane.  If  this  condition  is  valid  for  the  system,  we  can  be  sure  that 
the  solutions  of  Eq.  4-11  with  finite  input  P{t)  will  not  grow  with  bound. 

Setting  u(t)  and  P(t)  equal  to  zero  in  Eq.  4-11,  and  taking  the  Laplace 
transform  of  the  Eq.  4-11,  we  have 

sX  =  AX 

(sI-A]X  =  0  (4-12) 

In  order  to  have  nontrivial  solutin  for  Eq.  4-12,  the  determinant  of  matrix  si  —  A 
must  be  equal  to  zero 

|sI-A|  =  0  (4-13) 


The  eigenvalues  of  the  matrix  A  can  be  determined  by  solving  the  Eq.  4-13. 
Expanding  the  determinant  gives  a  2iV‘^-order  polynomieil  of  variable  s.  The 
roots  of  the  polynomial  are  the  eigenvalues  of  the  open-loop  control  system  which 
are  denotes  by  Ai ,  A2, . . . ,  X2N- 

From  control  theory,  the  controllability  depends  on  the  coefficient  matrices 
A  and  B.  The  system  4-11  is  said  to  be  completely  controllable  if  and  only  if  the 
rank  of  the  controllability  matrix  H 


H  =  [B,AB,a2B,...,a2'^-'B] 


(4-14) 
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is  2N  which  meains  that  the  determinant  of  matrix  H  is  not  equal  to  zero.  If  this 
condition  is  valid,  a  control  system  for  steering  from  a  given  state  to  any  other 
given  state  is  possible  to  find. 

Case  Study 

Throughout  Chapters  4  and  5,  the  discussion  and  design  of  the  feedback 
control  loop  are  based  on  the  system  model  including  the  first  two  vibration 
modes.  Then  the  state  differential  equation  is  a  4-dimensional  system.  The  state 
vector  chosen  is 


x(<)  = 


’w{.5L,t)' 
w{L,t) 
W{.5Ly  t) 


The  numerical  values  for  the  system  pareuneters  are  assigned  as 


(4-15) 


a  =  0.25  n  =  1.0  fa  =  1.8  C  =  0-5 

1F(0I 


From  the  output  of  the  FORTRAN  computer  program  (listed  in  .Appendix),  we 
have  the  coefficient  matrices 


'.1753 

.4341' 

Q  = 

'2.270 

0 

■  .0835 

-.1346' 

.5525 

1.310 

0 

8.739 

= 

-.1346 

.2168 

.5525 

1.310 


-1.292' 

2.082 


(4-17) 


Stability 

The  state  differential  equation  of  the  open-loop  control  system  without  damp¬ 
ing  and  external  forces  is  calculated  form  Eqs.  4.9  and  4.10  a** 
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x(<)  = 


r  ° 

0 

1 

0l 

r  0  1 

0 

0 

0 

1 

x(0± 

0 

-154.42 

48.275 

0 

0 

.677 

L -459.15 

143.41 

0 

oJ 

L  2.014  J 

(4-18) 


The  stability  of  Eq.  4-18  depends  on  the  eigenvalues  of  matrix  A  which  are 
calculated  as 


±1.507i,  ±2.956:’ 


These  four  poles  are  located  on  the  imaginary  axis  and  the  system  is  neutrally 
stable  which  means  that  the  response  neither  dies  out  nor  grow  to  infinity.  This 
is  what  we  expected  because  there  is  no  damping  in  the  system. 

The  state  differential  equation  of  the  open-loop  system  with  damping  is 


x{t)  = 


r  ^ 

0 

1 

0  1 

r  0  -1 

0 

0 

0 

1 

x(f)± 

0 

-154.42 

48.275 

-47.235 

15.38 

.677 

1-459.15 

143.41 

-140.42 

54.73  J 

L  2.014  J 

(4-19) 


The  poles  of  the  open-loop  system  are 


-.1083  ±2.950:’,  -.0418  ±  1.5082:  (4-19a) 

The  damped  frequencies  and  damping  ratios  for  the  first  two  modes  are 

(wd)i  =  2.952,  (wrf)2  =  1.509 
Cl  =  .0367,  C2  =  .0277 

The  poles  of  the  damped  system  are  within  the  left-hand  plane  and  it  is 
exponentially  stable.  We  see  that  the  damping  ratios  for  the  two  modes  are  very 
low.  This  explans  the  resonant  phenomenon  of  the  frequency  response. 


37^3 


Controllability 


The  cc  *itrollability  matrix  of  the  undamped  system  is 

■  0  .667  0  -7.317  ' 

0  2.014  0  -22.014 

.677  0  -7.317  0 

.2.014  0  -22.014  0 

which  can  be  easily  shown  that  it  has  the  rank  4  and  the  system  is  completely 
controllable. 

The  controllability  matrix  of  the  damped  system  is 


■  0 

.667 

-1.003 

—5.539  ■ 

0 

2.014 

-2.964 

-16.756 

.677 

-1.003 

-5.539 

15.658 

.2.014 

-2.964 

-16.756 

46.799  . 

of  which  the  rank  4  may  be  shown  to  be  4  and  the  system  is  completely  control¬ 
lable. 


Performance  Improvement  bv  State 
Feedback  Control 

From  preceding  section,  we  know  that  the  open-loop  control  system  is  slightly 
damped  because  the  poles  of  the  system  are  close  to  the  imaginary  axis.  This 
explains  why  the  resonant  phenomenon  occurs  when  the  system  is  subjected  to 
a  sinusoidal  input.  In  this  section  we  use  state  feedback  control  as  the  tool  to 
suppress  the  resonant  vibration  of  the  system  by  improving  its  performance,  which 
also  means  that  we  can  push  the  poles  far  to  the  left  of  the  complex  plane  by 
feedback  control.  This  is  called  the  pole- placement  technique. 
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The  closed-loop  system  is  made  up  of  sensors,  signal  amplifiers,  and  the  RMA 
(Fig.  4.2).  The  accelerations  are  measured  by  the  accelerometers  and  integrated 
twice  to  yield  the  velocities  and  displacements  for  feedback.  The  amplifiers  are 
modelled  as  a  set  of  constant  gains.  The  RMA  is  the  actuator  of  the  control 
system  which  generates  the  control  force  proportional  to  the  current  input. 

From  the  case  study  of  preceding  section,  we  saw  that  our  system  is  com¬ 
pletely  controllable  which  guarantees  that  a  control  law  can  be  found.  By  adjust¬ 
ing  the  gain  values,  the  eigenvalues  of  the  closed-loop  system  can  be  changed.  The 
main  effort  in  this  section  is  to  determine  the  gain  matrix  to  have  the  preferable 
frequency  response. 


Figure  4.2  State  Feedback  Control  System 


The  state  differential  equation  has  been  derived  in  the  first  section  of  this 
chapter  which  is  a  linear  and  time- invariant  system. 

k(f)  =  Ax(f) -H  Bu(0  +  P(0  (4-20) 

Suppose  that  the  complete  state  can  be  accurately  mesisured  at  all  times,  the 
conrol  law  implemented  in  Fig.  4.2  is  linear  which  has  the  form 
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u{i)  =  —Kx(t) 


where  K  is  the  feedback  gain  matrix. 


(4-21) 


Figure  4.3  Block  Diagram  of  the  Feedback  Control  System 

If  this  control  law  is  connected  to  the  system  4-20,  the  closed-loop  system 
(Fig.  4.3)  is  described  by  the  following  state  differential  equation 

x(t)  =  (A-BK)x(0  +  P(t)  (4-22) 

The  dynamics  of  the  closed-loop  system  depend  on  the  eigenvalues,  which  is 
referred  as  the  closed-loop  poles  of  the  system.  The  poles  are  determined  from 
the  matrix  A  —  BK.  By  choosing  suitable  constajit  gain  matrix  K,  we  can  move 
the  closed-loop  poles  to  left  in  the  complex  plane  to  have  required  system  response. 
Because  our  system  has  single-input  control  force  u,  the  gain  matri.x  is  uniquely 
found  for  a  given  set  of  closed-loop  poles. 

From  the  eigenvalue  problem  analysis,  we  set  the  external  forces  P(f)  =  0  in 
Eq.  4-22  and  take  the  Laplace  transform  of  this  equation  to  have 

sX  =(A  -  BK)X 
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[^I-(A-BK)]X  =  0  (4-23) 

The  determinant  of  the  the  matrix  5I  —  (A  —  BA')  must  be  equal  to  zero  to  have 
the  nontrivial  solution. 

Isl  -  (A  -  BK)1  =  0  (4-24) 

If  the  system  is  truncated  to  have  N  modes,  then  the  chairacteristic  polynomial 
obtained  by  expanding  the  determinant  of  Eq.  4-24  is  a  2jV‘* -order  polynomial. 
It  has  2N  poles  which  can  be  located  at  any  positions  by  eissinging  siutable  gains 
in  the  K  matri.x.  Let  the  closed- loop  poles  are  Ai ,  A2, . . . ,  A2N-  The  corresponding 
polynomial  in  factor  form  is 

(s-Ai)(5-A2)...(s-A2n)  (4-25) 

Comparing  the  coefficients  of  both  polynomials  obtained  from  Eqs.  4-24  and  4-25, 
we  have  2N  equations  from  which  the  2N  unknown  gains,  Lj ,  ^'2, . . . ,  t2^v  can  be 
obtained.  Following  is  a  study  of  feedback  control  system. 

Case  Study 

We  continue  the  study  in  Section  4.2  to  illustrate  the  performance  improve¬ 
ment  by  the  state  feedback  control  system.  To  suppress  the  resonance  of  the  first 
two  modes  of  the  combined  system,  a  two-position  feedback  loop  is  a  good  model 
to  work  with  (Fig.  4.4).  We  measure  the  states  at  positions  i  =  0.5Z  and  x  =  L 
for  feedback.  The  state  differential  equation  of  the  open-loop  system  is  given  by 
Eq.  4-19 
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Figure  4.4  Two-Position  Feedback  Control  System 


r  0 

0 

1 

0  1 

r  0  -1 

0 

0 

0 

1 

x(f)  + 

0 

-154.42 

48.275 

-47.235 

15.38 

.677 

1-459.15 

143.41 

-140.42 

45.73  J 

L  2.014  J 

x(t)  = 


The  poles  of  the  open-loop  system  are 

-.1083  ±  2.950i,  -.0418  ±  1.5082i 

Let  us  consider  the  control  law 


u{t)  (4-19) 


u{t)  =  -[kuk2,k3,k4]x{t) 


(4-19a) 


(3-26) 


Suppose  we  want  the  closed-loop  poles,  Ai,  A2,  A3,  A4,  to  be  located  at  — 1.5  i  .21 
eind  —.5  ±  .11,  then  the  control  law  is  calculated  eis 


u(f)  =  -(-219.8, 71.22, -33.84. 12.62)x(t)  (4-27) 

Substituting  Eq.  4-27  into  Eq.  4-19  and  adding  a  concentrated  force  F{t)6{x  - 1), 
we  have 


x(0  = 


r  ° 

0 

1 

0  1 

r  0  1 

0 

0 

0 

1 

x(0  + 

0 

-6028 

1963 

-5133 

1718 

.665 

L -17933 

5840 

-15270 

5II0J 

L2.O21J 

Fit) 


(4-2S) 
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For  sinusoidal  input,  F{t)  =  For  impulse  input  F{t)  =  ^(t).  The  frequency 
response  and  impulse  response  of  the  displacement  w{L,  t)  of  the  system  without 
and  with  state  feedback  control  system  are  shown  in  Figs.  4.5  and  4.6,  respec¬ 
tively. 


1 


.5 


e 

8  1  2  3  4  S 

Figure  4.5  Frequency  Response  of  the  System  without  and  with 
State  Feedback  Control  System 
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Figure  4.6  Impulse  Response  of  the  System  without  and  with 
State  Feedback  Control  System 
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Design 


47 


In  preceding  section,  we  saw  that  the  performance  of  the  system  can  be 
improved  by  state  feedback  control  system.  By  placing  the  closed-loop  poles 
to  the  left  of  the  complex  plane,  the  frequency  response  of  the  system  can  be 
improved  to  any  degree  that  we  want.  However,  this  is  under  the  assumption 
that  control  force  is  unlimited.  In  any  practical  problem  the  control  force  must 
be  bounded.  This  imposes  a  limit  on  the  distance  over  which  the  closed-loop 
poles  can  be  moved  to  the  left.  This  consideration  leads  to  the  formulation  of  an 
optimization  problem  which  takes  both  the  effect  of  vibration  reduction  and  the 
magnitude  of  the  control  force  into  the  design  consideration. 

The  closed-loop  sytem  derived  in  the  preceding  section  ccin  be  expressed  in 
state-space  form 

x(0  =  Ax(0  +  Bu(t)-bP(0  (4-10) 

which  is  a  linear  time- invariant  state  equation  with  linear  feedback  control  law 

u{t)  =  -Kx(t)  (4-29) 

The  design  goal  of  optimal  control  system  is  to  find  a  control  law,  which  under  the 
limitation  of  the  control  effect,  Ccin  reduce  the  vibration  of  the  beam  as  much  as 
possible.  A  performance  index  hzis  to  be  defined  to  have  a  mathematical  statement 
of  the  design  goal.  Based  on  the  performaince  index,  the  optimal  design  is  found. 

The  design  goal  can  be  divided  into  two  peirts.  One  is  to  minimize  the 
vibration,  the  other  is  to  reduce  the  control  effect.  Usually  a  quadratic  integral 
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form  is  employed  ais  the  index.  The  index  for  vibration  of  the  beam  is  chosen  as 
following 

x^(f)RrxX(t)<if  (4-30) 

where  x(t)  is  the  physical  coordinates  which  are  the  particular  positions  for  where 
the  vibration  need  to  be  suppressed  and  the  Rn  is  a  nonnegative-definite  sym¬ 
metric  weighting  matrix. 

The  index  for  control  effect  is 

/  u{t)R^^u{t)dt  (4-31) 

Jo 

where  the  R^u  is  a  positive-definite  symmetric  weighting  matrix. 

Hence,  the  performance  index  taking  both  effects  into  consideration  is 

/•OO 

J  =  /  [x^{t)RiiX{t)  +  u{t)RuuHt)]dt  (3-32) 

Jo 

If  we  can  find  a  control  force  u’(f)  such  that  the  performance  index  J  is  minimized, 
we  call 

u*(0  =  -K*x(t)  (4-33) 

the  optimal  feedback  control  law  with  the  optiaml  feedback  gain  matrix  K*.  The 

system  4-10  with  Eq.  4-33  is  called  an  optimal  linear  quadratic  regulator  (LQR). 
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Case  Study 

We  continue  the  study  in  Section  4.3  with  the  state  differential  equation 


u(f)  (4-19) 


0 

0 

1 

0  - 

■  0  - 

0 

0 

0 

1 

x(0  + 

0 

-154.42 

48.275 

-47.24 

15.38 

.677 

.-459.15 

143.41 

-140.42 

45.73. 

.2.014. 

x(f)  = 

If  the  weighting  matrices  Rn  and  Ru„  are  chosen  as 

Ru„  =  [1  *  10~^| 


Rzx  = 


ro 

0 

0 

Lo 


0 

1 

0 

0 


0  0l 
0  0 
0  0 
0  OJ 


(4-34) 


which  means  that  the  design  goal  is  to  reduce  the  vibration  of  the  beam  at  position 
oi  X  =  L  and  the  number  chosen  for  Ruu  depends  on  the  control  force  limitation. 


The  optimal  feedback  control  law  calculated  is 


u*(t)  =  -[-51.53, 48.82, 425.3, -138.2]x(f)  (4-35) 


Substituting  Eq.  4-35  into  Eq.  4-10,  we  have  the  optimal  closed-loop  system  as 


x(t)  = 


r  0 

0 

1 

0  1 

r  0  -] 

0 

0 

0 

1 

x(0  + 

0 

-120 

15.2 

-335 

109 

.665 

L-335 

45.1 

-997 

324  J 

L  2.021  J 

F[i) 


(4-36) 


The  frequency  response  and  impulse  response  of  the  displacement  ui{L.t)  of  the 
system  with  and  without  optimal  feedback  control  are  shown  in  Figs.  3.7  and  3.S, 
respectively. 
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a  1  2  3  4  5 

Uj/uJi 


Figure  4.7  Frequency  Response  of  the  System  without  and  with 
Optimal  Control  System 
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Figure  4.S  Impulse  Response  of  the  System  without  and  with 
Optimal  Control  System 
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Sensitivity  “ 

As  we  saw  in  the  preceding  study,  the  designed  feedback  control  system  • 

is  based  on  a  truncated- mode  model  which  has  a  discrepancy  between  the  real 
system.  This  can  be  viewed  as  a  kind  of  parameter  changes  in  the  modeling 
precedure.  Besides,  the  assumptions  of  modeling  prodecure  also  bring  this  kind  • 

effects  to  the  system  equation.  In  this  section  study  whether  or  not  the  optimal 
control  system  subjected  to  the  parameter  changes  still  works. 

A  very  important  property  of  the  feedback  con'rol  system  is  its  ability  to 
suppress  noise  and  to  compensate  for  pau-ameter  changes.  We  will  investigate  to 
what  extent  the  system  with  optimal  LQR  design  possesses  this  property.  We 
call  this  property  the  sensitivity  of  the  system.  Our  study  is  limited  to  the  effects 
of  parameter  changes  and  consider  only  the  steady-state  case  where  the  terminal 
time  is  infinite. 

The  optimal  feedback  control  system  is 

k(f)  =  Ax(0  +  Bzi(f)  +  P(f)  (4-10) 

with  the  optimal  control  law 

u*(f)  =  -K*x(f)  (4-33) 

From  control  theory  (See  .Appendix  7,  Chapter  3),  the  sensitivity  of  the  closed- 
loop  system  for  compensating  the  parameter  changes  as  compared  to  an  equivalent 
opne-loop  configuration  is  determinied  by  the  return  difference  matrix  J(sj 

J(s)  =  I  +  (sI- AW'BK*  (4-37) 
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If  the  condition 

J^{-juj)WJ(juj)  >  W  (4-38a) 

for  every  real  input  frequency  u  is  satisfied,  where  the  matrix  W  is  the  weighting 
matrix  of  the  sensitivity  criterion  4-38, 

W  =  K'^R„,.K'  (4-38b) 

we  can  guarantee  that  the  optimal  control  system  design  has  the  ability  to  com¬ 
pensate  the  parameter  changes. 

Because  the  matrix  K*  Ru^K*  is  known  after  the  control  law  has  been 
computed,  it  is  difficult  to  choose  the  desigii  parameters  R^^  and  R„u  such  that 
the  condition  4-38  can  be  achieved.  However,  under  the  condition  that  the  poles 
of  the  equivalent  open-loop  controlled  system  are  all  within  the  left-hand  plane, 
the  condition  4-38  can  be  guaranteed. 

The  equivalent  statement  of  condition  4-38  is  the  same  as  that  if  the  condition 

W  R„  (4-39) 

as  Ruu  — >  0  is  satisfied,  the  sensitivity  of  the  system  can  be  guarnateed. 

Case  Study 

From  the  Ccise  study  in  Section  4.4,  if  the  weighting  matrices  Rrr  and  Ruu 
are 

-0  0  0  0- 

r*  0  10  0  „  f, 

^““  0  0  0  0  -  [1  X  10 

.0  0  0  0. 
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the  optimal  feedback  gmn  matrix  K*  calculated  from  MATRIXx  is 

K*  =  [-224.,  3232, 452.6, -96.3] 

The  weighting  matrix  W  in  Eq.  4-38  is 


■  .0005 

-.0228 

-.0004 

-.0001- 

w  = 

-.02280 

1.0142 

.0193 

.0035 

-.0004 

.0193 

.0004 

.0001 

.  -.0001 

.0035 

.0001 

.0000  . 

The  matrix  W 

is  quite  close  to  the  limiting  matrix  Rn- 

This  means  that  the 

design  of  control  system  based  on  the  truncated  model  guzircintees  that  it  has 
enough  ability  to  compensate  the  parameter  changes  and  to  suppress  the  noise. 
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CHAPTER  5 


Estimator  Design  and  Output  Feedback 
Control  System 

Reconstructabilitv 


Because  the  linear  optimal  control  law  requires  the  complete  state  for  feed¬ 
back,  we  have  to  reconstruct  the  complete  state  when  the  state  measurement  is 
incomplete  and  inaccurate.  In  this  section  we  study  whether  or  not  the  com¬ 
plete  state  can  be  reconstructed  from  the  measured  state  (output).  We  call  this 
ability  the  reconstructability  of  the  system.  Before  designing  an  estimator,  we 
have  to  discuss  the  reconstructability  to  make  sure  that  such  an  estimator  can  be 
desinged. 

The  state  differential  equation  for  the  system  with  output  equation  is  given 
by  the  following 


x(t)  =  Ax(t)  +  Bu(f ) 

y(t)  =  C'x(f)  (5-1) 

which  is  a  linear  and  time-invariant  system  with  dimension  2N.  From  control 
theory  we  know  that  the  reconstructability  of  the  system  5-1  depends  on  the 
coefficient  matrices  A  and  C'.  The  system  5-1  is  said  to  be  completely  rccon- 
structable  if  and  only  if  the  rank  of  the  recontruct ability  matri.x  H', 
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H'  = 


C 

CA 

CA^ 


Lca^^-' 


(5-2) 


is  2N  which  means  that  the  determinant  of  matrix  H'  is  not  equal  to  zero.  If 
this  condition  is  valid,  an  estimator  for  reconstructing  the  complete  state  from  an 
imcompletelly  measured  output  is  possible  to  design. 

Case  Study 

We  continue  the  case  studied  in  Chapter  4.  The  open-loop  control  system  is 
described  by  the  differential  equation 


x{t)  = 


r  ° 

0 

1 

0  1 

r  0  -1 

0 

0 

0 

1 

x{t)  -f 

0 

-154.42 

48.275 

-47.235 

15.38 

.677 

L -459.15 

143.41 

-140.42 

45.73  J 

L  2.014  J 

u{t)  (5-3) 


If  the  output  state,  that  we  want  to  measure,  is  the  displacement  u’(I.f).  then 
the  output  coefficient  matrix  C'  is 


C'  =  (0,1,0,01 
w{L,t)  =  [0,  l,0,0|x(f) 


(5-4) 

(5-5) 


The  rescontructability  of  the  system  5-1  depends  on  the  matrix  H'.  calculated  ns 

0  1  0  0- 

^  0  0  0  1 

“  “  -459.153  143.412  -140.42  45.73 

.  686.73  -220.64  -247.82  74.98. 

which  has  the  rank  4.  This  means  that  the  state  of  system  5-1  is  completely 
constructable.  Therefore,  the  estimator  design  will  be  possible. 
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Full- Order  Estimator  Design 

When  the  complete  state  is  not  available  for  feedback  or  there  is  a  lot  of  noise 
in  measurement,  we  need  to  reconstruct  the  complete  state  from  the  paiticilly 
measured  state.  As  we  czin  see  from  Chapter  4,  the  more  vibration  modes  are 
included  in  the  model,  the  more  acceleramtors  and  integrators  we  will  need  for 
the  feedback  loop.  With  this  in  mind,  we  can  imagine  that  the  feedback  loop  will 
become  very  complicated.  This  situation  usually  is  not  feasible.  An  estimator  will 
be  the  answer  to  this  problem.  Our  study  here  is  focused  on  full-order  estimator 
design  which  means  the  estimator  can  generate  the  complete  state  for  feedback. 


Figure  5.1  Block  diagram  of  a  full-order  estimator 

The  state  differential  equation  of  the  system  is 

x(t)  =  Ax(t)  +  Bu(t)  (5-1) 

If  the  output  of  the  system  is  w(L,  t),  we  can  find  a  coefficient  matrix  C  which  is 
C'  =  [0  0  ...  1  0  0  ...  0],x2.v  (5-6) 
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such  that 

y(0  =  t)  =  C'x{t)  (5-7) 

The  design  goad  is  to  find  an  estimator  taking  the  w{L,  t)  and  u{t)  as  input 
which  can  genarate  output  state  x(f)  equal  to  the  real  system  state  x{t)  after 
Tuning  the  system  a  finite  period  of  time.  Based  on  this,  we  can  set  a  state 
differential  equation  for  the  estimator  as 

x(t)  =  Fx(f )  -1-  Gew{L,  t)  +  Hu(<)  (5-S) 

By  substracting  Eq.  5-1  from  5-S  and  combining  with  Eq.  5-7.  we  have  the 
following  differential  equation  for  x(f)  —  x{t), 

x{t)  -  i(t)  =  [A  -  GeC']x(t)  -  Fx(0  -f  [B  -  H]u(0  (5-9) 

Because  x{t)  —  x(t)  after  a  period  of  time,  from  Eq.  5-9  we  can  obtain 

F  =  A  -G^C' 

H  =  B  (5-10) 

Therefore.  Eq.  5-9  can  be  written  as 

x(t)  -  x{t)  =  [A  -  G,C'][x(t)  -  x(0]  (5-11) 

which  is  the  differential  equation  of  the  reconstruction  error  of  the  estimator. 
From  Eq.  5-11  we  can  easily  see  that  the  performance  of  the  estimator  depends 
on  the  matrix  [A  —  G^C'].  Then  the  state  differential  equation  of  the  estimator 
from  Fig.  5-1  can  be  written  as 

x(t)  =  Ax{t)  -f  Ge[u;(I,t)  -  C'x(t)]  +  Bu(t) 

=  [A  —  GeC']x(t)  +  Gew{L,  t)  +  Bu(t)  (5-12a) 
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And  the  estimated  output  of  the  system  is  w{L,t) 

w{L,t)  —  C'x{t)  {5-12b) 

The  performance  of  the  estimator  depends  on  the  eigenvalues  of  the  matrix 
[A  — GgC']  in  Eq.  5-12.  So  fax,  choosing  matrix  Gj  is  still  arbitrary.  By  using  the 
pole  placement  technique,  we  can  have  the  requried  performance  for  the  estimator. 
The  determination  of  the  estimator  gain  matrix  Ge  is  the  maiin  task  of  the  design. 
Because  the  design  procedure  is  the  same  as  the  one  in  Section  4.3,  we  will  not 
repeat  it  here. 

Case  Study 

We  continue  the  case  study  with  open-loop  control  system  (refering  to  Eq. 
4-19).  Let  the  control  force  u(t)  is  equal  to  zero  in  this  case. 


If  the  poles  of  matrix  [A  -  G^C']  are  placed  at  -3.  ±  0.03i  and  -2.  ±  0.02?. 
The  estimator  gain  matrix  calculated  is 


■  2.748  ■ 
^  8.495 

*  ~  4.516 

.13.603. 


(5-14) 
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The  state  differential  equation  for  the  estimator  is 


r  ° 

-2.748 

1 

0  ^ 

r  2.748  -] 

0 

-8.495 

0 

1 

8.495 

-154.42 

43.758 

-47.235 

15.38 

x(0  -1- 

4.516 

L -459.15 

129.81 

-140.42 

45.73  J 

L  13.603  J 

(5-15) 


And  the  reconstruct  output 


w{L,t)  =  [Q  1  0  Olx(t)  (5-16) 

The  initial-condition  response  of  the  real  output  and  estimated  output  is  shown 
in  Fig.  5.2. 


Figure  5.2  Initial-condition  response  of  the  real  output  and  estimated  output 
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Output  Feedback  Control  Svstem 

In  this  section  we  connect  the  optimal  LQR  with  estimator  to  have  the  output 
feedback  control  system.  This  system  takes  the  measured  output  y{t)  for  feedback. 
The  overall  performance  of  the  output  feedback  control  system  depends  on  both 
regulator  ajid  estimator.  From  contol  theory  we  know  that  the  eigenvalues  of 
the  complete  system  consists  of  the  eigenvalues  of  regulator  and  the  eigenvalues 
of  estimator  separately.  This  means  that  we  can  design  regulator  and  estimator 
independantly  and  then  put  them  together. 

The  state  differential  equation  with  optimad  LQR  is 

x{t)  =  Ax(t)  +  Bu*{t)  +  P(t)  (5-17) 

and  the  measured  output  is  given  oy  the  following  equation 

tZ;(I,f)  =  C'x(0  (5-lS) 

The  optimal  control  law  calculated  in  Section  4.4  with  estimated  state  is 

u*(t)  =  -K’x{t)  (5-19) 

Also,  the  estimator  designed  in  preceding  section  satisfies  the  following  state 
differential  equation 

i(f)  =  [A  -  G*C']x(0  +  Bu-(f)  -K  GMLJ)  (5-20) 

where  G*  is  the  estimator  gain  matrix.  Fig.  5.3  shows  the  interconnection  of  the 
beam-RM.A.  system,  the  optimal  control  law,  and  estimator. 
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I  output  Feedback  control  systen 


Figure  5.3  The  output  feedback  control  system 

Combine  state  differential  equations  5-17  and  5-20  with  Eqs.  5- IS  and  5- 
19  into  one  equation.  Then  the  complete  system  connected  with  optimad  LQG 
regulator  and  estimator  is  described  by  the  state  differential  equation 


’x(0 

_x(t) 

A 

G,C' 


-BK* 

A  -  G«C'  -  BK* 


x{t) 

i(0j 


-I- 


P(0 

0 


(5-21) 


The  performance  of  the  complete  system  depends  on  the  eigenvales  of  Eq. 
5-21.  Based  on  the  linear  optimal  contiol  theory  we  know  that  the  poles  of  the 
system  5-21  consists  of  the  optimal  LQR  poles  and  the  estimator  poles.  Therefore, 
if  both  optimal  LQR  and  es  imator  are  well  designed,  the  response  of  the  complete 
system  subjected  to  the  external  forces  should  be  satisfactory. 
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CcLse  Study 

From  the  results  of  case  study  in  Chapter  4  and  preceding  section,  the  state 
differential  equation  of  complete  system  can  be  calculated  as 


0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

—  154.4 

4S.2S 

-47.24 

15.38 

34.88 

-33.05 

-287.9 

93.53 

-459.2 

143.4 

-140.4 

45.73 

103.8 

-98.32 

—856.5 

278.2 

0 

2.75 

0 

0 

0 

-2.75 

1 

0 

0 

8.5 

0 

0 

0 

-S.5 

0 

1 

0 

4.52 

0 

0 

-119.5 

10.7 

-335.1 

108.9 

0 

13.6 

0 

0 

—355.4 

31.5 

-996.9 

324 

The  frequency  response  and  impulse  response  of  the  system  without  and  with 
output  feedback  control  system  are  shown  in  Figs.  5.4  and  5.5,  resp«tctively. 
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without  output  feedback  control - 


with  output  feedback  control 


Figure  5.4  Frequency  Response  of  the  System  without  and  with 
Output  Feedback  Control  System 


e  3  6  9  12  IS 

t{sec.) 


Figure  5.5  Impulse  Response  of  the  System  without  and  with 
Output  Feedback  Control  System 
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Conclusions 


Modal  analysis  approach  has  been  successfully  applied  to  the  cantilever 
beam-RMA  system  to  solve  the  boundary- vlaue  problem.  This  approach  is  more 
straight-forwzird  in  understanding  and  application  than  the  approach  of  Green’s 
functions  in  [2].  Although,  the  Euler- Bernoulli’s  beam  equation  is  used,  the  model 
derived  for  the  system  can  be  modified  to  have  damping  effect  in  the  beam. 

Incorporati*i  of  RM.\  to  the  cantilever  beam  introduces  an  additional  vibra¬ 
tion  mode  to  the  system.  By  adjusting  the  tuning  and  damping  of  RMA,  the 
system  response  can  be  changed.  RMA  used  in  this  way  is  called  a  passsive  vi¬ 
bration  absorber  (PVA).  As  we  saw  in  Figs.  3.2  and  3.3  the  first  mode  is  splited 
into  two  modes  with  little  effect  on  the  other  modes  when  the  RMA  is  tuned 
to  the  first  mode.  The  same  phenomenon  happens  for  the  case  of  tuning  to  the 
second  mode.  Increasing  damping  in  RMA  can  bring  the  infinite  peak  response 
of  resonance  down  to  lower  level. 

If  the  system  is  lightly  damped,  analytic  solution  of  the  undamped  system 
will  be  a  good  approximation.  Analytic  solution  of  the  damped  system  is  not 
available  because  of  the  coupling  effect  from  the  damping  coefficient  matrix  in 
the  differential  equation.  Numerical  methods  have  to  be  employed  to  offer  a 
solution.  Modal  analysis  transforms  the  continuous  system  to  an  infinite-degree- 
of-freedom  system  which  needs  to  be  truncated  for  numerical  calculations.  To 
have  more  accurate  solution,  the  more  modes  should  be  included. 
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The  study  of  control  system  design  shows  that  empolying  RMA  to  generate 
control  force  is  a  successful  approach  to  reduce  the  beam  vibration.  State  feed¬ 
back  control  system  design  br-sed  on  linear  optimal  control  theory  improves  the 
performance  of  the  cantilever  beajn-RMA  system.  The  resonant  phenomenon  is 
suppressed  to  acceptable  level.  A  two-posiilon  feedback  loop  is  good  enough  for 
reducing  the  vibration  of  the  first  two  modes.  The  optimal  feedback  gains  are  sug¬ 
gested  to  this  model.  The  closed-loop  control  system  also  gaurantees  the  ability 
to  compensate  the  parameter  cha^.ges  in  the  modeling  process.  A  linear  full-order 
estimator  is  designed  for  the  control  system  in  the  case  that  the  complete  state 
is  not  available  for  feedback  or  can  not  be  accurately  measured. 

Parameter  optimization  for  the  PVA  to  have  best  frequency  response  is  in 
progress.  The  study  in  this  thesis  provides  theoritical  understanding  for  experi¬ 
mental  implimentation  which  is  a  task  in  the  future. 
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APPENDIX 


Program  Listing 


The  program  EIGMAT  is  written  to  solve  for  the  eigen¬ 
values  and  calculate  the  coefficient  matrices  of  the  state  • 


differential  equation  for  case  study. 

00100  PROGRAM  EIGMAT 

00110  REAL  FV(25)  ,  NF(25,20),  EV(20),  EW(20),  AN(20), 

00115+  C21(20),  MASS(20,20), 

00120+  DAMP(20,20),  STIFF { 2 0 , 2 0 ) ,  MS(20),  BB(20), 

00125+  RXX(20, 20) ,MSS(25,25) , 

00130+  W(20),  CND(20),  MSD(20) 

00140  EXTEP  AL  FCN , SHSH, SHCH, CHCH , SHS , SHC , CHS , CHC , SS , 
00145+  SC,CC 

00150  COMMON  /BLOCKl/EV  ,/BLOCK2/ ALPHA,  MU,  F,  /BLOCK3/ 
00160+  N3,  AN,  C21,  ZA,MS 

00170C  PROGRAM  IS  SET  FOR  SINGLE  TUNING  Fa  CASE.  FOR 
00180C  MULTI-TUNING  CASE,  MODIFY  LINE  590.  ALPHA=POINT 
00185C  MASS  M/MASS  OF  THE  BEAM,  MU=MA/M,  ZA=  C/(2*M*WB), 
00190C  WB=  (EI/DAL4) **0.5,  F=(WA/WB),  WA= (K/MA) **0 . 5 
00200C  MSD  IS  THE  NORMALIZED  MODE  SHAPE,  MS=MSD/CN 
00210 

00220  ALPHA=0.25 

00230  MU  =  1.0 

00240  ZA  =  0.5 

00250  N3  =5 

00260  N4  =  4*N3  +  1 

00270  DF  =  1.8 

00280  FMAX  =  1.8 

00290  N1  =  INT(FMAX/DF)  +  1 

00300  XO  =  0.001 

00310  DX=0.1 

00320  XMAX=15. 

00330  N2=INT(XMAX/DX) 

00340  XTOL  =  0.0001 
00350  YTOL  =  0.0001 
00360  NLIM  =  10 
00370  DO  10  1=  1,  1 

00380  FV(I)  =  L 

00390  DO  10  J=  1,  20 

00400  NF(I,J)  =  0 
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00410  EV(J)  =  0 

00420  AN(J)  =  0 

00430  C21(J)  =  0 

00440  MS(J)  =  0 

00450  BB(J)  =  0 

00460  W(J)  =  0 

00470  CND{J)  =  0 

00480  MSD(J)  =  0 

00490  10  CONTINUE 
00500 

00510  DO  20  1=  1,20 
00520  DO  20  J=  1,20 
00530  MASS(I,J)=  0. 

00540  DAMP(I,J)=  0. 

00550  STIFF(I,J)=  0. 

00560  RXX(I,J)=  0. 

00570  20  CONTINUE 
00580 

00590  DO  100  1=  N1,N1 

00600  K  =  1 

00610  F  =  DF  *  (I-l) 

00620  FV(I)  =  F 
00630 

00640  DO  100  J  =  1,  N2 
00650 

00660  XI  =  DX*  (J-1)  +  XO 

00670  Y1  =  FCN(Xl) 

00680  X2  =  DX*  J  +  XO 

00690  Y2  =  FCN(X2) 

00700  IF  (  Y1*Y2  .GT.  0  )  GO  TO  100 
00710 

00720  II  =  1 

00730  CALL  MDLNIN  ( FCN , XI , X2 , XR, XTOL, YTOL, NLIM,  1 1 ) 

00740  NF(I,K)  =  XR 
00750  K=K+1 
00760 

00770  100  CONTINUE 
00780 

00790  PRINT  110 

00800  110  FORMAT  (5(/),  5X,  7HFa  , 14H  EIGENVALUES *L) 

00810  DO  130  1=  1,N1 

00820  PRINT  120,  FV(I),  (  NF(I,K),  K=  1,  6) 

00830  120  FORMAT  (/,  7E10.4) 

00840  130  CONTINUE 
00850 

00860C  CALCULATING  THE  COEFFICIENT  MATRICES 
00870  DO  200  1=  1,  N3 
00880  EV(I)  =  NF(N1,I) 
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00890  AN(I)  =  (F**2)/(F**2-EV(I) **4) 

00900  C21{I)  =  -(COSH(EV(I))+COS(EV(I)))/(SINH(EV(I))+SIN 

00905+  (EV(I))) 

00910  DO  150  J  =  1,  N4 

00920  EW(J)  =  (EV(I)/(N4-1)  )  *  (J-1) 

0093  0  MSS(I,  J)  =  (COSH (EW(J)  ) -COS  (EW(J)  )  ) +C21  (I)  *  (SINK 

00935+  (EW(J)  )-SIN(EW(J)  )  ) 

00940  150  CONTINUE 
00950  MS(I)  ==  MSS(I,N4) 

00960  200  CONTINUE 

00970  DO  202  1=  1,  N3 

00980  DO  202  J=  1,  N3 

00990  KXX(I,J)  =  MS(I)  *  MS(J) 

01000  202  CONTINUE 
01010 

01020  CALL  MASSM  (SHSH, SHCH, CHCH, SHS , SHC, CHS , CHC, SS , SC, CC, 
01025+  MASS  ) 

01030  CALL  STIFFM  ( SHSH , SHCH , CHCH , SHS , SHC , CHS , CHC , SS , SC , CC 
01035+  ,  STIFF  ) 

01040  DO  400  1=  1,  N3 

01050  CND(I)  =  {1/MASS(I,I)  )  **0.5 

01060  W(I)  =  STIFF(I,I)/(MASS(I,I) ) 

01070  DO  430  J-  1,  N4 

01080  MSS(I,J)  =  CND(I)*MSS(I,J) 

01090  430  CONTINUE 

01100  MSD(I)  =  MSS(I,N4) 

OHIO  BB(I)  =  (  1  -  AN(I))*  MSD(I) 

01120  400  CONTINUE 

01130  CALL  DAMPM  (  MSD,  DAMP  ) 

01140 

01150  PRINT  205 

01160  205  FORMAT (5 (/) ,45HMAGNITUDES  OF  NORMALIZED  MODE 
01170+  SHAPES  FROM  X=0,42H  TO  L,  DX/L  =0.05  WHICH  ALSO 
01175+  IS  MATRIX  G.) 

01180  DO  208  I  =  1,  N3 

01190  PRINT  280  ,  (MSS  (I, J) , J=1,N4) 

01200  208  CONTINUE 
01210  PRINT  410 

01220  410  FORMAT (3 (/) ,21HNCOEFFICIENT  MATRIX  Q  ) 

01230C  PRINT  280,  (MSD(I),  1=  1,  N3) 

01240  PRINT  420 

01250  420  FORMAT (3 (/), 30HDIAGONAL  TERMS  OF  MATRIX  OMEGA  ) 
01260  PRINT  280,  (W(I),  1=  1,  N3) 

01270C  PRINT  210 

01280  210  FORMAT(3(/),  IIHMASS  MATRIX) 

01290C  DO  220  1=  1,  N3 

01300C  PRINT  280,  (MASS(I,J),  J=1,N3) 

01310C  220  CONTINUE 
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01320  PRINT  230 

01330  230  FORMAT ( 5 (/), 16HDAMPING  MATRIX  C) 

01340  DO  240  1=  1,  N3 

01350  PRINT  280,  (DAMP(I,J),  J=1,N3) 

01360  240  CONTINUE 
01370C  PRINT  250 

01380C  250  FORMAT(5(/) , 16HSTIFFNESS  MATRIX) 

01390C  DO  260  1=1, N3 

01400C  PRINT  280,  (STIFF(I,J),  J=1,N3) 

01410C  260  CONTINUE 

01420  280  FORMAT(/,  7(2X,E10.4)) 

01430C  PRINT  290 

01440  290  FORMAT  (5(/),28HRXX  OF  THE  PERFORMANCE  INDEX) 

01450C  DO  295  I  =  1,  N3 

01460C  PRINT  280,  (RXX(I,J),  J=  1,  N3) 

01470C  295  CONTINUE 

01480  PRINT  300 

01490  300  FORMAT(5 (/) , 31HTHE  COEFFICIENT  MATRIX  B  ) 
01500  PRINT  280,  (  BB(I),  1=  1,  N3) 

01510  STOP 
01520  END 
01530 

01540  SUBROUTINE  MDLNIN  ( FCN, XI , X2 , XR, XTOL, FTOL, NLIM, I ) 
01550  LOGICAL  PRIN 
01560  PRIN=.TRUE. 

01570  IF  (I  .NE.  0)  PRIN  =  .FALSE. 

01580  F1=FCN(X1) 

01590  F2=FCN(X2) 

01600  IF  (F1*F2  .GT.  0)  GO  TO  50 

01610  FSAVE=F1 

01620  DO  20  J=  1,NLIM 

01630  XR=X2-F2* (X2-X1)/ (F2-F1) 

01640  FR=FCN(XR) 

01650  XERR=ABS (Xl-X2)/2 . 

01660  IF(.NOT.  PRIN)  GO  TO  5 
01670  PRINT  199,  J,  XR,  FR 

01680  199  FORMAT (IH, 13 HAT  ITERATION  ,  14,  5H  X  =  ,E12.5, 
01690+  9H,  F(X)  =  ,E12.5) 

01700  5  IF  (XERR  .LE.  XTOL)  GO  TO  60 
01710  IF  (ABS(FR)  .LT.  FTOL)  GO  TO  70 

01720  IF  (FR*F1  .LT.  0)  GO  TO  10 

01730  XI  =  XR 
01740  FI  =  FR 

01750  IF  (FR+FSAVE  .GT.  0)  F2  =  F2/2 

01760  FSAVE  =  FR 

01770  GO  TO  20 

01780  10  X2  =  XR 

01790  F2  =  FR 
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01800  IF  (FR*FSAVE  .GT.  0)  FI  =  Fl/2 

01810  FSAVE  =  FR 

01820  20  CONTINUE 
01830 

01  40  I  =  -1 

01.50  PRINT  200,  NLIM,  XR,  FR 

01860  200  FORMAT  (IHO,  26HTOLERANCE  NOT  MET.  AFTER  ,  14, 
01870+  15H  ITERATIONS  X  =  ,  E12.5,  12H  AND  F(X)  =  , 

01875+  E12.5) 

01880  RETURN 
01890  50  I  =-2 
01900  PRINT  201 

01910  201  FORMAT  (IHO,  35HFUNCTION  HAS  SAME  SIGN  AT  XI 
01915+  AND  X2) 

01920  RETURN 
01930 

01940  60  I  =  1 

01950  PRINT  202,  J,  XR,  FR 

01960  202  FORMAT  (IHO,  19HX  TOLERATNCE  MET  IN  ,14, 

01965+  18H  ITERATIONS.  X  =  , 

01970+  E12.5,  8H  F(X)  =  ,  E12.5) 

01980  RETURN 

01990  70  I  =  2 

02000  PRINT  203,  J,  XR,  FR 

02010  203  FORMAT  (  IHO,  19HF  TOLERANCE  MET  IN  ,  14, 

02015+  18H  ITERATIONS.  X  =  , 

02020+  E12.5,  8H  F(X)  =  ,  E12.5) 

02030  RETURN 
02040  END 
02050 

02060  SUBROUTINE  MASSM  (SHSH,SHCH, CHCH, SHS , SHC, CHS , CHC, SS , 
02065+  SC,CC,M) 

02070  REALA(20),  M(20,20),  C(20),  MS(20) 

02080  COMMON  /BLOCK2/  AL,  MU,  F,  /BLOCK3/  N,  A,  C,  ZA,  MS 
02090 

02100  DO  100  I  =  1,  N 
02110  DO  100  J=  1,  N 

02120  M(I,J)  =  CHCH(I,J)  -  CHC(I,J)  -  CHC(J,I)  + 

02130+  CC(I,J)+C(I)*(  SHCH(I,J)  -  SHC(I,J) 

02135+  -  CHS(J,I)  +SC(I,J)  ) 

02140+  +C(J)*(  SHCH(J,I)  -  SHC(J,I)  -  CHS(I,J)  + 

02145+  SC(J,I)  ) 

02150+  +C(I)*C(J)*(  SHSH(I,J)  -  SHS(I,J)  - 

02155+  SHS(J,I)  +  SS(I,J)  ) 

02160+  +AL*  MS(I)*  MS(J)*  ( 1+MU*  ( I )  *A  ( J)  ) 

02170  100  CONTINUE 
02180  RETURN 
02190  END 
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MATRIXx  Input  File 

*****  This  is  for  open-loop  system  with  damping.  ***** 
*****  For  undamped  system,  modify  matrix  A.  ***** 

alpha=0.25;  mu=1.0;  fa=l.8;  zeta=0.5; 

A  =  [  0  0  1  0  ; 

0  0  0  1  ; 

-154.42  48.275  -47.235  15.38  ; 

-459.15  143.41  -140.42  45.73]; 

B  =  [  0  0  .665  2.021]  '  ; 

C  =  [  0  1  0  0  ];  D=0; 

S  =  [A  B; 

C  D]  ; 

NS=4;  NR=[0.01;5;300] ; 

[0ME,H]  =  FREQ (S, NS, NR) ; 

XI  =  ABS (H) ; 

TMAX  =  15;  NPTS  =  300;  XO  =  [0  .1  0  0]'; 

[T,X3]  =  IMPULSE (S, NS, TMAX, NPTS) ; 

[T,XI1]  =  INITIAL(S,NS, XO, TMAX, NPTS ) ; 

*****  Optimal  LQG  regulator  ***** 

QC  =  [  0  0  .677  2.014] '  ; 

RXX  =  DIAG([0  1  0  0]);  RUU  =  lE-3 ; 

[EVAL,K]  =  REGULATOR (A, QC, RXX, RUU) 

AK  =  A  -  QC*K; 

S  =  [AK  B; 

C  D]  ; 

[OME,H]  =  FREQ (S, NS, NR) ; 

X2  =  ABS(H) ; 

[T,X4]  =  IMPULSE (S, NS, TMAX, NPTS ) ; 

X  =  [XI  X2] ; 

XIM  =  [X3  X4]; 

*****  Estimator  design  ***** 

POLES  =  [-3.+  0.03*JAY,  -2 . +0 . 02 *JAY ] ; 

GE  =  POLEPLACE ( A ' , C ' , POLES ) 

AE  =  A  -  GE'*C  ; 

S  =  [  AE  GE' ; 

C  D  ]  ; 

XO  =  [0  -.5  0  0]';  DELTAT  =  .05; 

[T,XI2]  =  LSIM(S,NS,XI1, DELTAT, XO) ; 

XI  =  [XII  XI2] ; 

*****  Output  feedback  control  system  ***** 

AO  =  [  A  -QC*K  ; 

GE'*C  AE-QC*K  ] 

BO  =  [  B  ;  0  ;  0  ;  0  ;  0 ]  ; 

CO  =  [  C  0  0  0  0  ]; 

SO  =  [  AO  BO  ; 

CO  D  ]  ; 

NS  =  8  ; 
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[OME,H]  =  FREQ (SO, NS, NR) ; 

XO  =  ABS  (H)  ; 

XO  =  [XI  XO]  ; 

TMAX  =15.  ;  NPTS  =  300  ; 

[T,X5]  =  IMPULSE (SO, NS, TMAX, NPTS ) ; 
XIO  =  [X3  X5]; 
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